Next Article in Journal
Estimation of PM2.5 Concentration Based on PSO-Optimized Machine Learning Models and SHAP Analysis: A Case Study of Wuhan, Hubei Province
Previous Article in Journal
An Identification Method for Vulnerable Bridges Based on the SCPR Model
Previous Article in Special Issue
Accuracy of Interproximal Space Measurement Across Different Orthodontic Tooth-Segmentation Programs: A Comparative Clinical Study
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Curve of Spee Leveling by Using Three Different Clear Aligner Systems: A Comparative Analysis

Department of Clinical Sciences and Translational Medicine, University of Rome Tor Vergata, 00133 Rome, Italy
*
Author to whom correspondence should be addressed.
Appl. Sci. 2026, 16(13), 6321; https://doi.org/10.3390/app16136321 (registering DOI)
Submission received: 17 May 2026 / Revised: 14 June 2026 / Accepted: 19 June 2026 / Published: 24 June 2026
(This article belongs to the Special Issue Advances in Orthodontics and Dentofacial Orthopedics)

Abstract

Aim: To evaluate the leveling of the curve of Spee (COS) of the lower arch in deep-bite patients by using three different clear aligners: Invisalign® Align Technology® Inc. (Santa Clara, CA, USA), Spark™ Clear Aligner System Ormco™ (Brea, CA, USA) and Angel Aligner™ Technology Inc. (Shanghai, China). Material and Methods: Sixty-nine patients were selected based on specific criteria and subdivided in three different groups based on clear aligner systems used for their treatment: Invisalign® Aligner (I group), Spark™ Clear Aligner (S group), and Angel Aligner™ (A group). All patients were treated from 2021 to 2025 by the same orthodontist with extensive experience in the aligner technique, following a standardized protocol for deep-bite resolution. The STL files of the lower arch were analyzed using MeshLab software at three specific time points: pre-treatment (ID), post-treatment (AC), defined as the result achieved after the first set of aligners, and digital planning (EC). In all three groups, the differences between AC-ID and EC-AC were examined by means of the T-test; intergroup variations were also compared using the ANOVA test. Results: The study revealed that aligners were only minimally effective at leveling the COS. The planned dental movements were not fully achieved, leading to a low accuracy of the treatment. The highest mean accuracy was detected in the Angel Aligner™ group (39%). Conclusions: The clear aligner treatment for leveling the COS presents limits in terms of biomechanics of the device. The study shows that no clinically and statistically significant differences emerged across the three systems used.

1. Introduction

Deep-bite malocclusions are among the most common orthodontic problems and are characterized by a multifactorial etiology involving both skeletal and dentoalveolar components.
One of the most frequently associated dental features is an accentuated curve of Spee (COS), particularly in the mandibular arch, which contributes significantly to excessive overbite and occlusal disharmony [1]. The COS represents the anteroposterior curvature of the mandibular occlusal plane and plays an important role in masticatory function, force distribution, and occlusal balance. Its depth is influenced by the vertical position of anterior and posterior teeth. The assessment of the COS can be easily performed on either conventional plaster casts or digital models, and its leveling is considered a fundamental step in achieving functional occlusion, long-term stability, and proper intercuspation [1].
Recent evidence from a systematic review and meta-analysis further confirmed that leveling the curve of Spee represents a fundamental objective of orthodontic treatment, regardless of the appliance used, as it contributes to improved occlusal relationships, functional stability, and correction of deep-bite malocclusions. Nevertheless, considerable variability remains regarding the effectiveness and predictability of the different treatment modalities available for achieving this objective [2].
Biomechanically, leveling the COS can be achieved through different strategies such as intrusion of the anterior teeth, extrusion of the posterior teeth, or a combination of both, depending on the patient’s skeletal pattern and dentoalveolar characteristics. These movements aim to reduce excessive overbite, improve occlusal relationships, and establish a more harmonious and stable occlusal plane. Consequently, accurate evaluation and correction of the COS remain key objectives in the orthodontic management of deep-bite patients.
In recent years, clear aligner therapy has gained widespread acceptance among both clinicians and patients due to its aesthetic advantages, comfort, and improved oral hygiene maintenance. Nevertheless, despite the continuous technological advancements in aligner materials, software planning, and attachment design, the predictability of certain orthodontic movements remains a significant clinical concern. Among these, the correction of deep bite through mandibular COS leveling represents one of the most challenging treatment objectives.
Several investigations have demonstrated that clear aligners exhibit variable effectiveness in achieving planned tooth movements, suggesting that some movements are inherently less predictable than others. In particular, extrusion of posterior teeth, which often plays a crucial role in deep-bite correction, has consistently been reported as one of the least predictable movements in aligner therapy [3,4,5,6,7,8,9,10].
Such limitations may negatively affect the efficiency of treatment and increase the need for refinement stages or auxiliary mechanics.
The concept of treatment accuracy, defined as the degree of correspondence between the planned tooth movement and the clinically achieved outcome, has therefore become increasingly relevant in contemporary orthodontic research. Previous studies evaluating deep-bite correction with clear aligners have reported relatively low accuracy rates, with mean values ranging from approximately 30% to 40% when using Invisalign® [11,12].
Similarly, a more recent investigation demonstrated an average correction of only 33% of the planned deep-bite improvement, emphasizing the frequent necessity for additional finishing procedures to achieve the desired clinical outcome [13].
Recent systematic reviews have confirmed that clear aligners can effectively reduce deep overbite and level the curve of Spee; however, the predictability of these movements remains moderate, with clinically achieved outcomes frequently falling short of the digital treatment plan. Moreover, most of the currently available evidence is derived from studies investigating Invisalign® alone, while comparative data among different aligner systems remain largely unavailable [14].
Despite the growing body of evidence on aligner performance, the available literature remains limited regarding the specific predictability and accuracy of mandibular COS leveling [15,16,17,18].
Furthermore, the studies conducted so far have predominantly focused on a single aligner system, namely Invisalign®, thereby restricting the generalizability of their findings. Consequently, it remains unclear whether the observed limitations are attributable to aligner therapy as a treatment modality in general or whether they may be influenced by differences in aligner design, material properties, manufacturing processes, biomechanical protocols, or treatment-planning software specific to each commercial system. This represents an important gap in the current scientific literature. As the number of commercially available aligner systems continues to increase, clinicians are increasingly required to make evidence-based decisions regarding the most appropriate appliance for specific malocclusions and treatment objectives. However, comparative data evaluating the performance of different aligner brands in achieving the same orthodontic movement are still scarce.
The originality and clinical relevance of the present study lie precisely in this unexplored area. To the authors’ knowledge, no previous investigation has directly compared the accuracy of three major aligner systems in the correction of the mandibular curve of Spee within a homogeneous clinical setting. By evaluating Invisalign®, Spark™ and Angel Aligner™, this study aims not only to quantify the effectiveness of each system in leveling the mandibular COS but also to determine whether the choice of a specific aligner brand may significantly influence treatment outcomes.
Understanding potential differences among aligner systems could provide valuable information for clinical decision-making, improve treatment planning strategies, and contribute to a more realistic estimation of treatment outcomes. Ultimately, the findings of this research may help clinicians optimize deep-bite correction protocols and enhance the predictability of aligner-based orthodontic treatments.
Therefore, the aim of this study is to evaluate and compare the accuracy of Invisalign®, Spark™ and Angel Aligner™ in leveling the mandibular COS and to investigate whether the selection of a specific aligner system affects the achievement of the planned treatment objectives.
In the present study, effectiveness was defined as the actual amount of COS leveling achieved after treatment. Efficiency referred to the ability of the aligner system to accomplish the prescribed tooth movements. Accuracy (or predictability) was defined as the degree of correspondence between the achieved clinical outcome and the movement planned in the digital setup.

2. Material and Method

The present study was approved by the University of Rome Ethics Committee: reference 142.25 CET2.

2.1. Study Sample

This was a multicenter, retrospective observational study. Patients were selected, according to predefined inclusion and exclusion criteria, from three private orthodontic clinics based in Rome. The three clinics are managed within the same professional network, and patient recruitment, treatment planning, and clinical procedures were coordinated by the same lead clinician, ensuring consistency across all participating centers. All centers contributed equally to patient recruitment and data collection and followed identical treatment protocols and eligibility criteria throughout the study period.
Eligible patients were consecutively included in the overall study population. Group allocation reflected the type of aligner therapy documented in the existing clinical records.
Inclusion criteria were permanent dentition (excluding third molars), presence of deep bite, curve of Spee (COS) > 3 mm, Class I dental malocclusion, and a normodivergent skeletal pattern. Exclusion criteria included previous orthodontic treatment, periodontal disease, dental morphological anomalies, and mandibular asymmetry. All data were anonymized and standardized prior to analysis.
Patients were not prospectively allocated to a specific aligner system for the purposes of this study. Instead, they were identified from existing clinical records of subjects who had already completed treatment with one of the three aligner systems and met the eligibility criteria. The same selection criteria were applied to all groups to improve sample comparability and reduce potential selection bias.

2.2. Treatment Protocol

For deep-bite correction, a standardized protocol was performed: conventional attachments, bite ramps, aligner change every 10 days, 30 to 40 aligners per arch and overcorrection prescriptions about upper and lower incisor intrusion. No auxiliaries such as intermaxillary elastics, temporary anchorage devices (TADs), Power Ridges, or equivalent intrusive features were used. Interproximal reduction (IPR) of the teeth was performed where indicated by the operator according to the planned prescription, but nearly all cases showed a comparable amount of IPR.
Only conventional horizontal rectangular or occlusal beveled attachments were designed from the second molar to the canine. (Figure 1).
Recent evidence suggests that optimized attachments do not provide a clinically significant improvement over conventional attachments in the correction of deep overbite, with systematic reviews reporting inconsistent differences in treatment accuracy between the two designs [19,20].
Based on this evidence, conventional attachments are commonly used in routine clinical practice, and this same rationale guided their selection in the present study in order to ensure methodological consistency and sample homogeneity (Figure 2).

2.3. Measurement Protocol

Consents for both data collection and orthodontic treatment were obtained from all patients and/or their parents/tutors.
For each patient, the following diagnostic records were collected; specifically dental models of each subject were taken at 3 time points:
  • Pre-treatment digital casts (.stl files);
  • Post-treatment digital casts (.stl files);
  • The final stage of planned virtual setup digital casts (.stl files).
STL files were obtained by using intraoral scanner iTero® Orthodontic Element ver. 1.13.0.900 (Align Technology Inc., Santa Clara, CA, USA) and transferred to MeshLab software (Visual Computing Lab, Italy, 2023.12 version). All digital models and all linear measurements were used exclusively within MeshLab software, without the use or integration of any additional digital measurement tools or external software. Linear measurements were carried out using the built-in distance measurement (“Ruler”) tool of MeshLab, which allows the calculation of linear distances between selected anatomical landmarks on scaled STL files in millimetric units, provided that the original digital models are correctly preserved in real-size scale.
In each model the linear values were measured only of the lower arch as follows: firstly an occlusal plane (OP) of the mandibular arch was defined by drawing a line passing through three reference points, namely the distobuccal cusps of the second molars on the right and left sides and the midpoint of the central incisor edge [21] (Figure 3).
Perpendicular distances from the OP to the buccal and mesio-buccal cusp margins of each tooth, from the second molar to the canine on both sides, were evaluated. Measurements were obtained bilaterally by a first orthodontists and independently checked by a second orthodontists.
For each subject, the perpendicular distances correspond to three specific time points:
-
The initial distance (ID), referring to the vertical linear measurements obtained from the pre-treatment model;
-
The achieved change (AC), or results, representing the actual achieved outcomes, defined as all measurements performed on the post-treatment model, corresponding to the final position after the first series of aligners;
-
The expected change (EC), or results, referring to the values derived from the virtual planned model, corresponding to the ideal post-treatment outcome.

2.4. Statistical Analysis

Concerning statistics, measurements were initially obtained by a primary orthodontist and subsequently verified by a second examiner; thus inter-examiner reliability was assessed using the intraclass correlation coefficient (ICC), based on a two-way mixed-effects model with consistency type, across all centers and teeth, showing a good reliability (ICC range = 0.79–0.87). The agreement between right- and left-side measurements was also assessed using the ICC, showing a good agreement across variables (ICC range = 0.76–0.81). Given the good agreement, bilateral measurements were averaged to obtain a single representative value for each variable included in the statistical analysis.
The data regarding pre-treatment (ID), post-treatment results (AC), and expected change (EC) for the three groups were graphically explored by using the mean and standard error (SD) and are summarized in Table 1, which reports the median with the interquartile range (IQR).
Normality was assessed using graphical methods, and given the small sample size, the Shapiro–Wilk test was applied. To evaluate differences between AC and ID, and between EC and AC, a paired-samples t-test was performed. The significance level was set at 0.05, and p-values are reported in Table 2, Table 3 and Table 4. To account for multiple comparisons, a Bonferroni correction was applied; however, all results remained statistically significant after adjustment.
Differences between AC and ID, and between EC and AC, were calculated and expressed as mean values and standard deviations. To compare the three groups, one-way ANOVA was performed on the differences between AC and EC, with p-values reported in Table 5. Where a statistically significant difference was detected, post hoc pairwise comparisons were conducted using Tukey’s Honestly Significant Difference (HSD) test to adjust for multiple comparisons.
Data analysis was performed using Stata/IC v.15 (StataCorp, College Station, TX, USA) while graphs were constructed using Microsoft Excel v.2021.

3. Results

The study group included 69 patients (32 male and 37 female; mean age of 21y (SD = 2.6)), with 23 subjects for each type of treatment observed.
Baseline analysis showed statistically significant differences in initial severity among groups, particularly between Group A and the other groups. To account for potential confounding, baseline values were included as covariates in multivariable regression models. After adjustment, baseline was not significantly associated with the outcome, suggesting that it did not act as a major confounding factor in the observed group differences.
For the first group (Invisalign®), as shown in Table 2 and Figure 4, the greatest depth from the occlusal plane was observed in the first molars, with a mean distance of 2.80 mm (SD = 0.17) and a median of 2.80 mm (IQR= 2.70–2.95).
The results from the comparison between post-treatment (AC) and pre-treatment (ID) showed p-value ≤ 0.001, leading to the rejection of the null hypothesis according to which the two means were equal, showing a difference instead (Table 2).
Similarly, the comparison between planned (EC) and achieved (AC) results for every analyzed parameter showed p-value < 0.001, confirming that there were differences (Table 2).
It was noted that, in terms of mm, the largest differences between the post-treatment result and the virtual plan were found for the second premolars, with a mean value of 0.79 mm (SD = 0.19).
The highest accuracy (meant as the ability to accurately predict the result compared to the virtually planned objectives) was observed for the canines (52%), while the lowest value was noticed for the second molars (17%) (Table 6).
The results in Spark™ group (Table 3 and Figure 5) highlighted significant differences from 0 with a p-value < 0.001. The greatest depth from the occlusal plane, as shown in Table 1, was observed in the first molars, with a mean distance of 2.6 mm (SD = 0.19) and a median of 2.90 mm (IQR= 2.70–3.00). Specifically, the most relevant differences between AC and EC were recorded in the first premolars, with a mean value of 0.73 mm (SD = 0.11) (Table 3).
In terms of accuracy, the lowest differences were observed in the second molars (21%), while the highest ones were in the canines accounting for 54% (Table 6).
In the Angel Aligner™ group (Table 4 and Figure 6), the greatest depth from the occlusal plane was observed for the first molars, with a mean distance of 2.65 mm (SD = 0.1) and a median of 2.70 mm (IQR= 2.50–2.70).
All the results emerging from the two comparisons (AC with ID and EC with AC) showed p-value < 0.001. The largest differences between AC and EC were observed for the first premolars, with a mean value of 0.73 mm (SD = 0.03) (Table 4).
The lowest accuracy level was observed in the second molars (21%), while the highest one was noted in the canines at almost 58% (Table 6).
As shown by means of the ANOVA test (Table 5), the three groups compared by analyzing the delta between the planned and the obtained results did not present statistically significant differences.
An exception was observed for the canines, where one-way ANOVA revealed a statistically significant difference among groups (p < 0.001). Post hoc Tukey analysis demonstrated significant differences between Group A and Group S, as well as between Group A and Group I. For the first premolars, a significant difference was also detected (p = 0.020), with post hoc comparisons indicating differences between Group I and Group A, and between Group I and Group S. Finally, the first molars also showed a statistically significant difference (p = 0.047), specifically between Group I and Group A.
The study on accuracy (Table 6) did not show a high treatment level of precision. Overall, higher values were reported for the canines, whereas lower values were shown for second molars, with an overall mean slightly above 30% across the three groups.

4. Discussion

The aim of this study was to evaluate the COS correction by using aligners. Specifically, efficiency and effectiveness of the three brands of aligners were investigated.
To date, we know that no orthodontic appliance, whether fixed or removable, achieves 100% of the prescribed movements [22]. This limitation is particularly relevant in vertical control, where biomechanical expression is influenced by force magnitude, occlusal contacts, and anchorage balance.
In particular, COS leveling by using straight-wire technique reported 50% expression of the arches in 0.022-inch edgewise slots, strongly recommending an overcorrection strategy, such as reverse curves, to achieve higher clinical predictability [23].
A similar consideration may also apply to clear aligner therapy, where digital staging represents an idealized force system rather than the true biological outcome. Studies on predictability are helping clinicians determine whether overcorrection adjustments are necessary to actually achieve the desired movements. In this context, evidence from systematic reviews further supports the notion that the clinical expression of planned movements is often incomplete and highly variable. Papadimitriou et al. [24], in a systematic review on the clinical effectiveness of Invisalign® therapy, reported a moderate level of evidence with considerable heterogeneity among available studies, highlighting that predicted tooth movements are not fully achieved in clinical reality and that treatment outcomes are influenced by multiple biological and biomechanical factors.
Actually, aligners deliver intermittent and distributed forces that are highly dependent on material properties, fit, and the complex interaction between aligner and occlusal surfaces. Therefore, the discrepancy between virtual setup and achieved outcome is an expected phenomenon rather than an exception. Studies on predictability are therefore essential to better understand the real clinical effectiveness of programmed movements and to determine whether systematic overcorrections are required.
In the present study, the difference between achieved and planned COS leveling was analyzed across three aligner systems. Three samples of young adults in permanent dentition were considered: the mean depth of COS corresponded to 3.32 mm (SD = 0.17) for the Invisalign® group, 3.34 mm (SD = 0.18) for the Spark™ group, and 3.16 mm (SD = 0.14) for the Angel™ Group (A), while the median values were respectively 3.30 mm (IQR = 3.20–3.45), 3.40 mm (IQR = 3.20–3.50), and 3.20 mm (IQR = 3.05–3.20) (Table 1). These baseline similarities strengthen the comparability of the three cohorts and reduce potential bias related to initial severity.
Regarding measurements, strategic reference points were selected in order to include multiple dental elements and different vertical levels of the occlusal curvature, allowing a more accurate evaluation of every single dental movement.
The analyzed data of all three groups (Table 2, Table 3 and Table 4) underscored statistically significant differences between post and pre-treatment measurements (Δ AC-ID), in agreement with the previous literature, confirming that COS leveling occurs but not with full expression of the programmed movements. Specifically, the teeth that obtained a greater programmed correction across the three groups were the first premolars, whereas the molars extruded to a lesser extent, especially the second ones. These findings are consistent with the previous literature, indicating reduced predictability of vertical movements in posterior segments [15,25].
This reduced predictability may be explained by multiple biomechanical factors, including the inherent “bite-block effect” of aligners, high occlusal load concentration on molars, and limited vertical freedom within the aligner material [15]. According to a recent review of 26 years of evidence on deep-bite correction with aligners, posterior extrusion remains difficult to achieve due to the aligner itself acting as an occlusal splint, generating a bite-block effect that may counteract the programmed vertical movement. This phenomenon may contribute to the reduced effectiveness of curve of Spee leveling observed in the present sample [26].
Auxiliaries include inter-arch elastics from lower first molars or even the use of a stiffer material, with greater clearance between the inner surface of the aligner and the occlusal surface of the tooth; these are all factors that could promote a greater extrusive force vector on the tooth [15].
In contrast, higher values of movement have been reported by Khosravi et al. [23]. This discrepancy is likely attributable to a different methodological approach, as that study evaluated outcomes over multiple sets of additional aligners and at the end of the therapy and so there was more treatment time and a greater number of aligners to express the planned movements. Therefore, extrusion effectiveness may increase when assessed at the end of comprehensive treatment rather than only at the end of the first set of aligners, as performed in our study.
The present results also showed that the least pronounced movements were those of the second molars, in agreement with previous findings in the literature [10]. This may be related to reduced aligner adaptation, decreased control in the most distal segments, and biomechanical dissipation of forces along the arch length.
When comparing achieved versus digitally planned movements (Δ AC-EC), statistically significant differences were again observed across all three groups, with the greatest discrepancy at the premolar level. These findings confirm that a perfect correspondence between virtual planning and clinical outcome is not achieved. The discrepancy observed between planned and achieved outcomes is consistent with previous evidence indicating that virtual treatment setups tend to overestimate the actual expression of programmed tooth movements. Grünheid et al. [27] demonstrated that final tooth positions frequently differ from those predicted by the digital setup, emphasizing the biological and biomechanical factors that limit the complete realization of planned movements during aligner therapy.
Consequently, the larger the discrepancy between planned and obtained movement, the lower the predictability of the system. This highlights the importance of understanding not only whether a movement occurs, but also how closely it follows the digital prescription.
Overall, higher values were reported for the canines, while lower ones were recorded for the second molars. The first premolars generally performed slightly better than the second premolars, while the first molars turned out better than second ones, evidencing a difference from the previous literature reporting better performance in second molars (up to 52% accuracy) [15]. Nevertheless, the overall trend remains consistent with the concept that posterior vertical movements are among the least predictable in aligner therapy. Haouili et al. [9] similarly reported an average molar extrusion accuracy of approximately 40%, identifying these as one of the least predictable movements.
For the purposes of completeness, an intergroup comparison test was also performed regarding treatment predictability (Table 6). The statistical evidence showed that there was no significant difference between the three systems used. In addition to the individual dental elements, to explain the effectiveness and predictability of the treatment more clearly, the leveling of the COS was also analyzed. Overall treatment accuracy analysis (Table 6) revealed relatively low predictability, with an average slightly above 30%. Among the three systems, Angel Aligner™ demonstrated higher overall accuracy (36%), which may reflect differences in aligner material properties, force delivery consistency, or the interaction between staging protocols and biomechanical response. However, intergroup comparison did not reveal statistically significant differences, suggesting that despite small numerical variations, the three systems behave similarly in terms of overall predictability.
From a biomechanical perspective, it is important to emphasize that each orthodontic movement is composed of multiple components acting simultaneously in three dimensions. As previously suggested by Castroflorio et al. [10], isolating a single movement inevitably simplifies a highly complex reality, where force vectors, occlusal contacts, and material deformation continuously interact. Moreover, the final phase of the digital setup represents an idealized endpoint that does not fully account for biological variability and intraoral environmental factors.
In this study, the overall leveling of the COS showed limited predictability in all three groups. It should be considered that the included patients presented severe deep bite with a COS > 3 mm at baseline and were treated without auxiliary devices, and outcomes were assessed at the end of the first set of aligners without refinements. These factors likely contributed to the relatively lower predictability values compared to studies including refinements or adjunctive mechanics.
The present study has several limitations that should be acknowledged. First, its retrospective design and the inherent lack of randomization may introduce selection bias; however, strict inclusion and exclusion criteria were applied to minimize heterogeneity and improve sample consistency. Although the study was conducted across three private orthodontic clinics, potential operator-related variability in clinical execution and decision-making was minimized, as the main clinician responsible for patient recruitment, treatment planning, and protocol standardization was the same across all centers. This contributed to maintaining consistency in case selection and treatment procedures throughout the study. Standardized protocols for data collection and treatment execution were additionally applied in all participating clinics to further reduce inter-operator variability. Moreover, the use of digital STL models contributed to enhancing the reproducibility and reliability of the measurement protocol, allowing standardized landmark identification and consistent assessment of treatment outcomes across all subjects.
It should also be acknowledged that alternative attachment designs, including optimized or individualized configurations, may influence force distribution and the biomechanical expression of specific tooth movements, particularly in the vertical dimension. Although conventional attachments were used in the present study to ensure methodological consistency and sample homogeneity, different attachment designs may potentially modify the efficiency of extrusion mechanics and therefore deserve further investigation to improve the predictability of complex movements such as curve of Spee leveling and deep-bite correction.
Another relevant limitation is the absence of direct monitoring of patient compliance, including actual daily aligner wear time, which remains a known confounding factor in aligner-based therapy. However, adherence was routinely assessed during monthly follow-up visits through clinical evaluation, including assessment of aligner fit, degree of wear, and overall adaptation on the dental arches. Although these methods provide a clinically useful indication of compliance, it should be acknowledged that they remain indirect and partially subjective and therefore cannot fully replace objective monitoring systems.
Finally, a potential limitation of the present study is the relatively small sample size. Despite the aforementioned limitations, the multicenter design, standardized protocols, and homogeneous sample represent strengths of the study and contribute to reducing potential bias. In addition, it should be emphasized that the inclusion of three different aligner systems within the same methodological framework represents a distinctive strength compared to previously published studies, allowing a more direct comparison of system-related performance under standardized conditions.
Overall, the present findings contribute to a more realistic understanding of the biomechanical behavior of clear aligners in the vertical dimension, particularly in the context of deep-bite correction and COS leveling. These results may assist clinicians in setting more evidence-based expectations during treatment planning, especially regarding the need for potential overcorrection strategies in complex vertical discrepancies. Further prospective studies with larger samples and inclusion of refinements or adjunctive mechanics are warranted to better refine the predictability of vertical movements in aligner therapy.

5. Conclusions

Treatment with clear aligners for leveling the COS has shown limitations related to biomechanics.
Although Angel Aligner™ showed a tendency toward better performance, the differences among the three aligner systems were neither statistically nor clinically significant.
The results of this study suggest that mandibular COS leveling remains a challenging treatment objective when performed exclusively with clear aligners. Future studies should investigate whether alternative biomechanical approaches, adjunctive treatment strategies, or alternative digital setup protocols may influence the predictability of this movement. For example, hybrid protocols combining aligner therapy with other orthodontic mechanics may represent a promising area for further research, although such approaches were not evaluated in the present study.

Author Contributions

Conceptualization, methodology and writing original draft, G.M.; software, data curation, V.C.; data curation, M.C.; writing review and supervision, A.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. El-Dawlatly, M.M.; Fayed, M.M.S.; Mostafa, Y.A. Deep overbite malocclusion: Analysis of the underlying components. Am. J. Orthod. Dentofac. Orthop. 2012, 142, 473–480. [Google Scholar] [CrossRef]
  2. Francisco, I.; Pinto, A.L.; Nunes, C.; Prata Ribeiro, M.; Caramelo, F.; Marto, C.M.; Paula, A.B.; Travassos, R.; Vale, F. Effectiveness of Orthodontic Methods for Leveling the Curve of Spee: A Systematic Review with Meta-Analysis. Appl. Sci. 2025, 15, 12217. [Google Scholar] [CrossRef]
  3. Kravitz, N.D.; Kusnoto, B.; BeGole, E.; Obrez, A.; Agran, B. How well does Invisalign work? A prospective clinical study evaluating the efficacy of tooth movement with Invisalign. Am. J. Orthod. Dentofac. Orthop. 2009, 135, 27–35. [Google Scholar] [CrossRef]
  4. Rossini, G.; Parrini, S.; Castroflorio, T.; Deregibus, A.; Debernardi, C.L. Efficacy of clear aligners in controlling orthodontic tooth movement: A systematic review. Angle Orthod. 2015, 85, 881–889. [Google Scholar] [CrossRef] [PubMed]
  5. Houle, J.P.; Piedade, L.; Todescan, R.; Pinheiro, F.H. The predictability of transverse changes with Invisalign. Angle Orthod. 2017, 87, 19–24. [Google Scholar] [CrossRef] [PubMed]
  6. Zheng, M.; Liu, R.; Ni, Z.; Yu, Z. Efficiency, effectiveness and treatment stability of clear aligners: A systematic review and meta-analysis. Orthod. Craniofac. Res. 2017, 20, 127–133. [Google Scholar] [PubMed]
  7. Galan-Lopez, L.; Barcia-Gonzalez, J.; Plasencia, E. A systematic review of the accuracy and efficiency of dental movements with Invisalign®. Korean J. Orthod. 2019, 49, 140–149. [Google Scholar] [CrossRef] [PubMed]
  8. Robertson, L.; Kaur, H.; Fagundes, N.C.; Romanyk, D.; Major, P.; Flores-Mir, C. Effectiveness of clear aligner therapy for orthodontic treatment: A systematic review. Orthod. Craniofac. Res. 2020, 23, 133–142. [Google Scholar] [PubMed]
  9. Haouili, N.; Kravitz, N.D.; Vaid, N.R.; Ferguson, D.J.; Makki, L. Has Invisalign improved? A prospective follow-up study on the efficacy of tooth movement with Invisalign. Am. J. Orthod. Dentofac. Orthop. 2020, 158, 420–425. [Google Scholar] [CrossRef]
  10. Castroflorio, T.; Sedran, A.; Parrini, S.; Garino, F.; Reverdito, M.; Capuozzo, R.; Mutinelli, S.; Grybauskas, S.; Vaitiekūnas, M.; Deregibus, A. Predictability of orthodontic tooth movement with aligners: Effect of treatment design. Prog. Orthod. 2023, 24, 2. [Google Scholar] [CrossRef] [PubMed]
  11. Fontaine-Sylvestre, C. Predictability of Deep Overbite Correction Using Invisalign®. Ph.D. Thesis, University of Manitoba, Winnipeg, MB, Canada, 2019. [Google Scholar]
  12. Blundell, H.L.; Weir, T.; Kerr, B.; Freer, E. Predictability of overbite control with the Invisalign appliance. Am. J. Orthod. Dentofac. Orthop. 2021, 160, 725–731. [Google Scholar] [CrossRef]
  13. Shahabuddin, N.; Kang, J.; Jeon, H.H. Predictability of the deep overbite correction using clear aligners. Am. J. Orthod. Dentofac. Orthop. 2023, 163, 793–801. [Google Scholar] [CrossRef]
  14. Husain, F.; Riccardi, J.; Alletto, L.; Stellrecht, E.; Poursattar Bejehmir, A.; Al-Jewair, T. Effectiveness and Accuracy of Clear Aligners in Treatment of Deep Bite: A Systematic Review. Clin. Oral Investig. 2025, 29, 306. [Google Scholar] [CrossRef] [PubMed]
  15. Goh, S.; Dreyer, C.; Weir, T. The predictability of the mandibular curve of Spee leveling with the Invisalign appliance. Am. J. Orthod. Dentofac. Orthop. 2022, 162, 193–200. [Google Scholar] [CrossRef]
  16. Rozzi, M.; Tiberti, G.; Mucedero, M.; Cozza, P. Leveling the curve of Spee: Comparison between continuous archwire treatment and Invisalign system: A retrospective study. Am. J. Orthod. Dentofac. Orthop. 2022, 162, 645–655. [Google Scholar] [CrossRef]
  17. Ciavarella, D.; Fanelli, C.; Suriano, C.; Campobasso, A.; Lorusso, M.; Ferrara, D.; Maci, M.; Esposito, R.; Tepedino, M. Curve of Spee modification in different vertical skeletal patterns after clear aligner therapy: A 3D set-up retrospective study. Prog. Orthod. 2024, 25, 5. [Google Scholar] [CrossRef] [PubMed]
  18. Dianiskova, S.; Rongo, R.; Sirignano, D.; D’Amelio, R.; Oliva, G.; Valletta, R.; D’Antò, V. Assessment of clinical predictability of overbite reduction and curve of Spee leveling in clear aligner treatment. Orthod. Craniofac. Res. 2025, 28, 458–465. [Google Scholar] [CrossRef] [PubMed]
  19. Burashed, H.; Sebai, R.E. Quantifying the efficacy of overbite reduction in patients treated with clear aligners using optimized versus conventional attachments. J. World Fed. Orthod. 2023, 12, 105–111. [Google Scholar] [CrossRef] [PubMed]
  20. Muthuswamy Pandian, S.; Subramanian, A.K.; Vaiid, N. Comparison of efficacy and accuracy of tooth movements in optimized and conventional attachments of clear aligners: A systematic review and meta-analysis. J. Oral Biol. Craniofac. Res. 2025, 15, 1123–1133. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  21. Braun, S.; Hnat, W.P.; Johnson, B.E. The curve of Spee revisited. Am. J. Orthod. Dentofac. Orthop. 1996, 110, 206–210. [Google Scholar] [CrossRef]
  22. Pandis, N.; Polychronopoulou, A.; Sifakakis, I.; Makou, M.; Eliades, T. Effects of levelling of the curve of Spee on the proclination of mandibular incisors and expansion of dental arches: A prospective clinical trial. Aust. Dent. J. 2010, 26, 61–65. [Google Scholar] [CrossRef]
  23. Khosravi, R.; Cohanim, B.; Hujoel, P.; Daher, S.; Neal, M.; Liu, W.; Huang, G. Management of overbite with the Invisalign appliance. Am. J. Orthod. Dentofac. Orthop. 2017, 151, 691–699. [Google Scholar] [CrossRef]
  24. Papadimitriou, A.; Mousoulea, S.; Gkantidis, N.; Kloukos, D. Clinical effectiveness of Invisalign® orthodontic treatment: A systematic review. Prog. Orthod. 2018, 19, 37. [Google Scholar] [CrossRef] [PubMed]
  25. Simon, M.; Keilig, L.; Schwarze, J.; Jung, B.A.; Bourauel, C. Treatment Outcome and Efficacy of an Aligner Technique—Regarding Incisor Torque, Premolar Derotation and Molar Distalization. BMC Oral Health 2014, 14, 68. [Google Scholar] [CrossRef] [PubMed]
  26. Blundell, H.L. Deep Bite Correction with Clear Aligners—After 26 Years Where Are We At? Semin. Orthod. 2026, 32, 143–147. [Google Scholar] [CrossRef]
  27. Grünheid, T.; Loh, C.; Larson, B.E. How Accurate Is Invisalign in Nonextraction Cases? Are Predicted Tooth Positions Achieved? Angle Orthod. 2017, 87, 809–815. [Google Scholar] [CrossRef] [PubMed]
Figure 1. (Right): Pronounced curve of Spee (COS); (left): leveling of the COS. Upward arrows indicate extrusion of the posterior teeth, whereas the downward arrow indicates intrusion of the anterior teeth.
Figure 1. (Right): Pronounced curve of Spee (COS); (left): leveling of the COS. Upward arrows indicate extrusion of the posterior teeth, whereas the downward arrow indicates intrusion of the anterior teeth.
Applsci 16 06321 g001
Figure 2. Conventional horizontal rectangular or beveled occlusal attachments from the first molar to the canine.
Figure 2. Conventional horizontal rectangular or beveled occlusal attachments from the first molar to the canine.
Applsci 16 06321 g002
Figure 3. Occlusal plane (OP) and relative perpendicular distances of each tooth from the OP.
Figure 3. Occlusal plane (OP) and relative perpendicular distances of each tooth from the OP.
Applsci 16 06321 g003
Figure 4. Comparison of extrusive movements for each tooth during the Invisalign® treatment and the standard deviation. Note: All measurements are in mm. ID = initial distance; AC = achieved change; EC = expected change.
Figure 4. Comparison of extrusive movements for each tooth during the Invisalign® treatment and the standard deviation. Note: All measurements are in mm. ID = initial distance; AC = achieved change; EC = expected change.
Applsci 16 06321 g004
Figure 5. Comparison of extrusive movements for each tooth during the Spark™ treatment and the standard deviation. Note: All measurements are in mm. ID = initial distance; AC = achieved change; EC = expected change.
Figure 5. Comparison of extrusive movements for each tooth during the Spark™ treatment and the standard deviation. Note: All measurements are in mm. ID = initial distance; AC = achieved change; EC = expected change.
Applsci 16 06321 g005
Figure 6. Comparison of extrusive movements for each tooth during the Angel™ treatment and the standard deviation. Note: All measurements are in mm. ID = initial distance; AC = achieved change; EC = expected change.
Figure 6. Comparison of extrusive movements for each tooth during the Angel™ treatment and the standard deviation. Note: All measurements are in mm. ID = initial distance; AC = achieved change; EC = expected change.
Applsci 16 06321 g006
Table 1. Descriptive statistics of the linear measurements of each group from OP.
Table 1. Descriptive statistics of the linear measurements of each group from OP.
n = 23IDID
Mean (SD)Median (IQR)
GROUP INVISALIGN
CANINES1.009 (0.181)1.000 (0.900–1.150)
FIRST PREM1.713 (0.174)1.700 (1.600–1.850)
SEC PREM2.313 (0.169)2.300 (2.200–2.450)
FIRST MOL2.804 (0.172)2.800 (2.700–2.950)
SEC MOL1.813 (0.179)1.800 (1.700–1.950)
COS3.322 (0.168)3.300 (3.200–3.450)
GROUP SPARK
CANINES1.030 (0.184)1.100 (0.900–1.200)
FIRST PREM1.730 (0.187)1.800 (1.600–1.900)
SEC PREM2.330 (0.177)2.400 (2.200–2.450)
FIRST MOL2.600 (0.189)2.900 (2.700–3.000)
SEC MOL1.835 (0.177)1.800 (1.700–2.000)
COS3.339 (0.178)3.400 (3.200–3.500)
GROUP ANGEL
CANINES0.857 (0.144)0.900 (0.750–0.900)
FIRST PREM1.565 (0.140)1.600 (1.450–1.650)
SEC PREM2.165 (0.137)2.200 (2.050–2.200)
FIRST MOL2.652 (0.147)2.700 (2.500–2.700)
SEC MOL1.657 (0.134)1.700 (1.550–1.700)
COS3.161 (0.137)3.200 (3.050–3.200)
Note: All measurements are in millimeters (mm). OP = occlusal plane. ID = initial distance.
Table 2. Descriptive statistics of the delta values and p-values related to the T-tests with the calculation of accuracy in the Invisalign® group.
Table 2. Descriptive statistics of the delta values and p-values related to the T-tests with the calculation of accuracy in the Invisalign® group.
Group
Invisalign
T Test
ID e AC
Δ AC-IDΔ EC-IDT Test EC e ACΔ AC-ECAccuracy
p-ValueMean (SD)Mean (SD)p-ValueMean (SD)%
CANINES<0.001−0.23 (0.05)−0.44 (0.05)<0.0010.21 (0.01)51.8%
FIRST PREM<0.001−0.46 (0.04)−1.24 (0.05)<0.0010.78 (0.02)37.1%
SEC PREM<0.001−0.37 (0.08)−1.15 (0.17)<0.0010.79 (0.19)32.6%
FIRST MOL<0.001−0.25 (0.05)−0.86 (0.10)<0.0010.61 (0.09)29.1%
SEC MOL<0.001−0.09 (0.04)−0.53 (0.12)<0.0010.45 (0.12)17.0%
COS<0.001−0.26 (0.02)−0.89 (0.10)<0.0010.63 (0.09)29.8%
Note: All measurements are in millimeters (mm). Positive numbers indicate an intrusive movement relative to the OP; negative numbers indicate an extrusive movement relative to the OP. Note 2: p-values are uncorrected. All results remained significant after Bonferroni correction (α = 0.00417).
Table 3. Descriptive statistics of the delta values and p-values related to the T-tests with the calculation of accuracy in the Spark™ group.
Table 3. Descriptive statistics of the delta values and p-values related to the T-tests with the calculation of accuracy in the Spark™ group.
Group: SparkT Test
ID e AC
Δ AC-IDΔ EC-IDT Test EC e ACΔ AC-ECAccuracy
p-ValueMean (SD)Mean (SD)p-ValueMean (SD)%
CANINES<0.001−0.24 (0.05)−0.44 (0.04)<0.0010.20 (0.01)54.1%
FIRST PREM<0.001−0.46 (0.03)−1.19 (0.11)<0.0010.73 (0.11)38.9%
SEC PREM<0.001−0.36 (0.04)−1.07 (0.08)<0.0010.71 (0.09)34.0%
FIRST MOL<0.001−0.23 (0.05)−0.81 (0.12)<0.0010.58 (0.10)28.7%
SEC MOL<0.001−0.11 (0.02)−0.56 (0.11)<0.0010.45 (0.12)20.8%
COS<0.001−0.24 (0.02)−0.82 (0.11)<0.0010.58 (0.10)29.8%
Note: All measurements are in millimeters (mm). Positive numbers indicate an intrusive movement relative to the OP; negative numbers indicate an extrusive movement relative to the OP. Note 2: p-values are uncorrected. All results remained significant after Bonferroni correction (α = 0.00417).
Table 4. Descriptive statistics of the delta values and p-values related to the T-tests with the calculation of accuracy in the Angel™ group.
Table 4. Descriptive statistics of the delta values and p-values related to the T-tests with the calculation of accuracy in the Angel™ group.
Group
Angel
T Test
ID e AC
Δ AC-IDΔ EC-IDT Test
EC e AC
Δ AC-ECAccuracy
p-ValueMean (SD)Mean (SD)p-ValueMean (SD)%
CANINES<0.001−0.26 (0.03)−0.44 (0.02)<0.0010.19 (0.02)57.7%
FIRST PREM<0.001−0.49 (0.02)−1.22 (0.04)<0.0010.73 (0.03)40.0%
SEC PREM<0.001−0.40 (0.03)−1.11 (0.13)<0.0010.71 (0.12)36.5%
FIRST MOL<0.001−0.26 (0.05)−0.82 (0.11)<0.0010.54 (0.10)31.6%
SEC MOL<0.001−0.10 (0.05)−0.49 (0.06)<0.0010.38 (0.08)21.4%
COS<0.001−0.26 (0.05)−0.82 (0.11)<0.0010.57 (0.11)31.6%
Note: All measurements are in millimeters (mm). Positive numbers indicate an intrusive movement relative to the OP; negative numbers indicate an extrusive movement relative to the OP. Note 2: p-values are uncorrected. All results remained significant after Bonferroni correction (α = 0.00417).
Table 5. Descriptive statistics of the delta values and p-values related to the ANOVA test in the comparison between the three groups.
Table 5. Descriptive statistics of the delta values and p-values related to the ANOVA test in the comparison between the three groups.
Group IGroup SGroup AAnova
Δ AC-ECΔ AC-ECΔ AC-ECΔ AC-EC
Median (IQR)Median (IQR)Median (IQR)p-Value
CANINES0.29 (0.28–0.31)0.29 (0.28–0.30)0.27 (0.26–0.28)<0.001
FIRST PREM0.91 (0.89–0.92)0.87 (0.85–0.91)0.87 (0.83–0.89)0.0193
SEC PREM0.91 (0.69–1.00)0.80 (0.73–0.85)0.81 (0.73–0.88)0.1014
FIRST MOL0.68 (0.62–0.72)0.63 (0.57–0.70)0.59 (0.54–0.66)0.0477
SEC MOL0.46 (0.39–0.52)0.48 (0.41–0.56)0.39 (0.36–0.43)0.1897
COS0.69 (0.63–0.77)0.69 (0.59–0.74)0.60 (0.56–0.66)0.0807
Note: All measurements are in millimeters (mm). Positive numbers indicate an intrusive movement relative to the OP; negative numbers indicate an extrusive movement relative to the OP.
Table 6. Accuracy of linear measurements.
Table 6. Accuracy of linear measurements.
Accuracy
Group IGroup SGroup A
CANINES51.8%54.1%57.7%
FIRST PREM37.1%38.9%40.0%
SEC PREM32.6%34.0%36.5%
FIRST MOL29.1%28.7%31.6%
SEC MOL17.0%20.8%21.4%
COS29.8%29.8%31.6%
MEAN32.9%34.4%36.5%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Mampieri, G.; Cantiero, V.; Carillo, M.; Giancotti, A. Curve of Spee Leveling by Using Three Different Clear Aligner Systems: A Comparative Analysis. Appl. Sci. 2026, 16, 6321. https://doi.org/10.3390/app16136321

AMA Style

Mampieri G, Cantiero V, Carillo M, Giancotti A. Curve of Spee Leveling by Using Three Different Clear Aligner Systems: A Comparative Analysis. Applied Sciences. 2026; 16(13):6321. https://doi.org/10.3390/app16136321

Chicago/Turabian Style

Mampieri, Gianluca, Valeria Cantiero, Martina Carillo, and Aldo Giancotti. 2026. "Curve of Spee Leveling by Using Three Different Clear Aligner Systems: A Comparative Analysis" Applied Sciences 16, no. 13: 6321. https://doi.org/10.3390/app16136321

APA Style

Mampieri, G., Cantiero, V., Carillo, M., & Giancotti, A. (2026). Curve of Spee Leveling by Using Three Different Clear Aligner Systems: A Comparative Analysis. Applied Sciences, 16(13), 6321. https://doi.org/10.3390/app16136321

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop