1. Introduction
Establishing a surgical channel is a critical step in acoustic neuroma surgery. During this procedure, a portion of the skull must be removed via milling or drilling to expose the surgical target for subsequent interventions [
1]. The morphology of the removed skull area significantly impacts the surgeon’s ability to access the target region effectively in later stages of the operation. Additionally, the proximity of the milling site to the surrounding structures and the volume of bone to be excised are crucial factors in assessing potential postoperative complications. Consequently, preoperative planning of milling areas enables surgeons to gain a comprehensive understanding of the surgical procedure and assess the feasibility of the intervention, thereby promoting successful operations with minimal risk and potential harm [
2].
A surgical channel refers to the space created by milling part of the temporal bone, thereby exposing intracranial lesions. Robotic surgery for mastoidectomy has been developed to enhance the stability and precision in creating surgical channels [
3,
4,
5,
6]. Additional surgical navigation techniques, such as image-guided surgery and augmented reality navigation, can further assist with the bone milling procedure [
7,
8,
9,
10]. For these navigation techniques, preoperative milling area planning is increasingly crucial, serving as a spatial reference to guide milling operations. However, planning such a volumetric area is challenging due to the complexity of its spatial representation and the constraints imposed by the surrounding structures. For instance, the translabyrinthine approach is a transpetrosal presigmoid surgical route used for operative treatment. It is employed to quickly access the cochleovestibular nerve and treat tumors located in the lateral skull base [
11]. In this approach, the surgical channel must navigate around critical vessels and nerves [
1,
12], damage to which could lead to severe complications such as cerebrospinal fluid leaks and facial nerve dysfunction [
13,
14]. The current automated planning algorithms under such constraints predominantly address cochlear implants [
15] and stereoelectroencephalography (SEEG) [
16], where only a linear trajectory is required, leaving the planning of complex volumetric areas an unresolved issue. Practically, preoperative planning of milling areas largely depends on manual planning by surgeons, which takes 35 minutes on average [
17] and often lacks comprehensive consideration of the constructability of the planned milling area and the subsequent surgical procedures [
4,
5], limiting the effectiveness of preoperative planning in acoustic neuroma surgeries.
To address the aforementioned challenges, this study proposes a set of methods for modeling volumetric milling areas and the surrounding spatial constraints to enable automatic optimization. We aim to achieve automated planning for milling areas while ensuring the safety, constructability, and accessibility of the surgical targets in subsequent operations. We first employ high-resolution local segmentation masks of the key surrounding structures and deformable registration to construct the spatial distribution of critical structures, serving as constraints for subsequent planning algorithms. By combining the spatial constraints imposed by key structures with the physical dimensions of the surgical tools, the maximum milling area is generated as the initial input for optimization. Subsequently, a series of deformation fields are designed to control the volumetric milling area, using a surgical target accessing procedure as the foundational model. This method simplifies the complex three-dimensional volumetric area into a set of continuous parameters, while the boundaries formed by adjacent structures are converted into numerical constraints on these parameters. Finally, target accessibility, potential risk of injury, and the constructability of the surgical channel are quantitatively evaluated, with an evolutionary multi-objective optimization algorithm applied to automatically generate a surgical plan with optimal safety and feasibility. In detail, our method can generate a surgical plan in 5840.1 ± 279.9 s, reducing potential damage to the scala vestibuli by 29.8%, improving the milling boundary smoothness by 78.3%, and increasing target accessibility by 26.4%.
This paper makes the following main contributions:
A set of spatial modeling methods for parameterizing irregular volumetric milling areas;
Evaluation methods for milling areas that are critical to the constructability of the surgical channel;
The first automated planning algorithm which enables volumetric spatial planning within a feasible amount of time.
2. Related Work
Currently, most surgical planning studies in neurosurgery emphasize trajectory planning, such as electrode insertion for deep brain stimulation [
18] and inner ear access for cochlear implantation [
15,
19,
20]. In these cases, the planning target can be simply represented by an entry point on the skull and a target point within the brain. The representation limits the target definition space and allows each trajectory to be straightforwardly evaluated, thereby facilitating the application of optimization algorithms to these planning problems. In contrast, defining an irregular volumetric area presents a far more complex challenge. The inability to accurately model the planning target not only makes the problem unsuitable for automatic optimization but also complicates the evaluation of the planned surgical channel. Therefore, the current research primarily employs heuristic methods to generate milling area plans, avoiding the need to explicitly parameterize an irregular volume. Popovic et al. [
21] simplified this problem by neglecting the thickness of the skull, allowing them to project the three-dimensional contours of a tumor directly onto the adjacent skull surface to define the resection area. However, this method is unsuitable for temporal regions, where the bone thickness varies considerably, while critical structures, such as the vessels and nerves, are present. In a different approach, McBrayer et al. [
17] pre-planned surgical channels on an atlas with segmented critical structures. After that, deformable volume registration was applied to map the surgical plan from the atlas onto the new CT data. However, this method overlooks how the planned milling area impacts the constructability of the surgical channel and subsequent surgical procedures. Although the milling area may be well planned in the atlas, the deformation process can reduce its feasibility in real-world operations. Aghdasi et al. [
22] and Rajesh et al. [
2] enhance this method by creating multiple plans on the atlas. Unlike McBrayer et al. [
17]’s study, segmentation of the critical structures was also performed on the new CT data, with each milling plan evaluated in relation to the surrounding structures to identify the least risky option. Nonetheless, these approaches still lack personalized consideration of the surgical targets and a quantitative assessment of the plan’s feasibility.
3. Methods
3.1. Problem Definition
Planning target: As illustrated in
Figure 1, surgeons must remove part of the skull
to expose the surgical target
during acoustic neuroma surgery. This removal creates the surgical channel
, composed of the milled skull, the internal cavities
, and external space
, defined as
.
Spatial constraints: While the skull forms the definition space for milling operations, the surrounding critical structures impose certain limits on this area. Based on their clinical significance, the surrounding structures can be categorized into those that must remain intact , such as the sinus dura and the facial nerve, and those that should be minimally damaged , such as the scala vestibuli.
Evaluation: During the planning of the milling area, the potential risks relevant to the design of the area need to be considered. Key evaluations include the difficulty of constructing the planned surgical channel , how difficult it is for the surgical tools to reach the surgery targets after bone removal , and how much injury to the structures is caused by the milling procedure .
The comprehensive planning problem can thus be formulated as optimizing the milling area while ensuring it does not intersect with critical structures , aiming to minimize surgical risks .
3.2. The Automated Milling Area Planning Pipeline
However, the extremely high complexity of defining volumetric milling areas can result in impractical optimization times. To address this, we propose the framework illustrated in
Figure 2 for automated planning. Initially, a template-based approach was employed to segment the high-resolution risk structures
and
surrounding the milling areas. Subsequently, potential entry points on the skull surface
and target points
on the surgical target were identified.
For each target , the maximum feasible milling area that contributes to the surgical procedure was calculated, adhering to the constraints of critical structures . This process generated a maximum potential milling region, which served as the boundary for optimization: .
To control these volumetric areas, a set of deformation control functions
was designed. By projecting each
onto a control plane and applying
, the volumetric milling area could be modulated using the parameters defined by these functions. This approach inherently transformed the complex spatial constraints into numerical boundaries for parameter adjustments. Lastly, we evaluated patient injury
, constructability
, and target accessibility
for the surgical channel under the parameter set
. Multi-objective optimization was then employed to identify the most suitable surgical plan.
The following part of this paper details the methods proposed in this paper. In detail,
Section 3.3 includes the method for critical structure segmentation,
Section 3.4.1 includes the method for generating the maximum possible milling area,
Section 3.4.2 includes the method for volumetric milling area parameterization, and
Section 3.4.3 includes the milling area evaluations and the implementation of multi-objective optimization, while
Section 3.5 includes a set of methods for computing acceleration to enable automatic planning within a reasonable amount of time.
3.3. Critical Structure Segmentation
Segmentation of the critical structures, such as the sinus dura, the facial nerve, and the scala tympani, surrounding the milling area in preoperative CT scans is essential for establishing the spatial constraints that inform the milling area’s design. However, this segmentation task is challenging due to several factors: the dimensions of certain critical structures approach the resolution limit of clinical CT devices (0.625 mm), soft tissues can be difficult to discern in CT images, and there is considerable variation in the sizes of different critical structures. These complexities pose significant challenges for deep-learning-based segmentation methods [
23,
24,
25].
Therefore, we employed the template-based segmentation pipeline illustrated in
Figure 3 to achieve robust segmentation of the critical structures beyond the imaging resolution limitations. Specifically, two cranial CT templates were developed (see
Figure 4): one encompassing the full cranial area,
, at a resolution similar to that used in standard clinical practice (0.625 mm × 0.625 mm × 0.625 mm), and another covering only the local region,
, around the lateral skull base area with a finer spatial resolution (0.125 mm × 0.125 mm × 0.125 mm). The full cranial template
served to establish the spatial connection between the target CT volumes and the high-resolution local template. To enhance the generalizability of template
,
was generated by registering and averaging multiple clinical CT volumes using the deformable registration method proposed in Avants et al. [
26]’s study. Mutual information (MI) was employed as the optimization metric during the registration process. Concurrently, the local template
was designed to provide a precise reference for accurate segmentation. To accomplish this, we utilized the high-resolution temporal bone segmentation dataset from Sieber et al. [
27]’s study, averaging it to form
. Moreover, to connect these two different spaces, anatomical landmarks, including the stylomastoid foramen, the geniculate ganglion, the top of the head of the malleus, the footplate of the stapes, and the arcuate eminence, were annotated on both templates, represented as
and
.
For segmentation of a new CT volume , it was first aligned with the full-head template through deformable registration to transfer the anatomical landmarks into the new volume space, namely . Subsequently, rigid body registration between and was performed using Singular Value Decomposition (SVD) to achieve an initial alignment between and . This was followed by deformable registration between the CT volumes in and , generating the non-rigid transformation mapping the high-resolution template to . Finally, the deformation field was applied to transfer the segmentation labels into the new volume space , generating the spatial constraints for subsequent planning. This approach ensures that the segmented critical structures retain the high resolution of the local template, while structural integrity and smoothness are maintained through the deformation-based method.
3.4. Automated Milling Area Planning
3.4.1. Maximum Milling Area Generation
To define a boundary for milling area planning and establish a foundation for quantifying the spatial constraints posed by critical structures, a maximum permissible milling area was first generated based on the structures that must remain intact during surgery . Prior to generating this area, the surgical target was manually annotated, while potential entry points were automatically identified from the skull surface near the surgical target.
For the construction of a feasible surgical channel, each milled voxel
should contribute to the surgical procedures following channel creation. Simplifying this by modeling the surgical tool as a cylinder,
s should meet the condition that an entry point
and a target point
exist such that the cylindrical region connecting
k and
t with the radius
r encompasses
s:
where
l denotes the linear trajectory between
k and
t, and
represents the distance from the spatial point
s to the trajectory
l.
Consequently, the maximum permissible milling area can be heuristically represented as the union of all potential trajectories of the surgical tools that avoid interaction with the critical structures:
Here,
represents whether the surgical tool will damage any critical structure when accessing
t though
s:
where
denotes the empirically minimum safe distance that the surgical tools should maintain from each critical structure. In our implementation, a safe distance of 0.5 mm was enforced for the scala tympani, the scala vestibuli, the malleus, the incus, the stapes, the chorda tympani, the tympanic drum, and the carotis, while 1.5 mm was set for the sinus dura, and 2.5 mm was set for the facial nerve.
Meanwhile,
in Equation (
3) represents the cylindrical spatial area covered by the surgical tool:
Although this method effectively generates the maximum permissible milling area while ensuring the validity of each milled voxel, the volumetric region is controlled by a set of unordered point pairs,
, which is unsuitable for further refinement. Therefore, we categorize
further based on each spatial element
t within the surgical target:
Because the critical structures are continuous, all feasible entry points k for a single target form a continuous distribution on the skull surface. Therefore, the maximum milling area can be expressed as the set of each spatial element in the surgical target paired with all feasible entry points relevant to that target, namely . The continuity of makes it eligible for parameterization.
Moreover, to ensure accuracy in milling area planning, the preoperative CT scans, critical structure segmentations, entry points, and target points were uniformly resampled to a spatial resolution of 0.25 mm prior to maximum milling area generation.
3.4.2. Volumetric Area Parameterization
The previous generation of the maximum milling area organized the irregular volumetric region into target points and corresponding feasible entry points for each target, represented by . However, the irregular skull surface on which is distributed presents challenges for complete parameterization.
To address this issue, we selected a control plane
parallel to the approach direction of the surgical operations (the A-S plane in RAS space). Each connection between
t and
k, where
, was then extended along the vector
to intersect with
at point
g. The generated point
g then serves as a unique substitute for
k, transforming the irregular distribution on the skull surface into a function for area selection
on the plane
:
where
represents the 3D spatial position of a point
on the control plane.
Since is continuously distributed over the skull surface, the corresponding forms a single connected component on the control plane. This property ensures that each point k within a subregion of () meets the spatial constraints imposed by structure .
To regulate the area
for optimization purposes, we developed a control function to transform
into a subset. For convenience,
was initially converted into the polar coordinate form
, centered at its mass center. Subsequently, a control function
was applied to modify
(see
Figure 5):
where
represents the deformation operation. Given the distribution of the critical structures, the shapes of
are predominantly convex. Hence, the deformed area
generally remains a subset of
in most cases. Nevertheless, a subset restriction was applied to the deformation process to ensure adherence to the spatial constraints:
Finally, the proposed function
was parameterized using
N control points
uniformly distributed along
. The value of
at each
was then determined through spline interpolation of the surrounding control points and constrained to the range
. This approach allows for complete parameterization of the milling area:
In this way, the entire volumetric area can be fully governed by the parameter set , consisting of parameters within the range , with an initial value of 1 for optimization. Here, N denotes the number of control points for a single control function, and represents the number of target points considered during optimization. Furthermore, the proposed modeling method, along with initialization using the maximum milling area, transforms the complex spatial constraints imposed by the surrounding critical structures into a numerical constraint on the parameters.
3.4.3. Milling Area Evaluation and Optimization
To achieve the optimal milling plan, we implemented a series of quantitative assessments, evaluating the feasibility of surgical channels at different parameter settings, including the accessibility of the surgical target after channel construction , the potential injury to bone and the surrounding structures , and the complexity of constructing the surgical channel .
Surgery target accessibility : In Mo [
28]’s study, a series of evaluation methods for surgical channel construction was proposed, including an intrinsic measure of the spatial accessibility
:
Here,
represents the difficulty of accessing a specific target
through the constructed surgical channel
[
28]. Therefore, we adopted this as the metric for surgical target accessibility:
Injury : During surgical channel construction, minimizing the damage to both the skull and the surrounding critical structures
is crucial. Therefore, the proportion of damaged volume within each tissue was adopted as a measure to evaluate the injury sustained during bone milling:
where
and
denote the weights assigned to the injury of different critical structures.
Surgical channel constructability : In Mo [
28]’s study, a metric was also proposed for evaluating the global spatial compactness
of the constructed surgical channel:
where
and
represent distinct spatial elements within the surgical channel
, and
is the Euclidean distance between these locations. If an obstacle exists between
and
,
is set to
∞. Generally,
indicates the compactness of the surgical channel. A more compact channel offers greater flexibility for adjusting the milling tool’s orientation during construction, facilitating easier channel creation.
Besides channel compactness, the boundary smoothness also affects the feasibility of accurately constructing the channel. Balasubramanian et al. [
29]’s study introduced the spectral arc length
for smoothness assessment. However, this metric is less suitable for three-dimensional boundaries. Given that the surgical channel is primarily defined by the target points and associated entry points, the contours of feasible entry points on the control plane
determine the smoothness of the entire channel boundary. Thus, this evaluation can be conducted on the two-dimensional control plane:
where
represents the first derivative of the contour of a two-dimensional region.
By combining these two evaluations, the constructability of the planned surgical channel can be determined:
Subsequently, the NSGA-III genetic optimization algorithm [
30,
31] was employed to iteratively refine the milling area, aiming for enhanced accessibility, safety, and constructability. Two solutions were selected from the Pareto frontier using the compromise programming algorithm [
32] and the pseudo-weight algorithm [
33]. Finally, the solution with the maximum
was chosen as the final result.
3.5. Automated Planning Acceleration
Although the proposed volumetric area modeling methods greatly simplify the definition space of the target milling area and make it compatible with optimization algorithms, the need to iterate over entry points
and surgical targets
, along with calculating the distances between the surgical tools and three-dimensional critical structures at a high spatial resolution, results in substantial time consumption. Therefore, this section introduces several methods (see
Figure 6) aimed at accelerating the automated planning procedure.
Symbolic Distance Field (SDF) of critical structures: In generating the maximum milling area, substantial computation was required to determine the distances between the surgical tools and the surrounding critical structures. Due to the high resolution of the critical structures and the dense spatial distribution of the distance queries, neither point-by-point iteration over structures nor k-d tree searches across the tool trajectories proved efficient enough for this task. To address this, three-dimensional signed distance fields were precomputed for each structure. In subsequent calculations, SDF lookup was employed to replace time-intensive nearest distance or distance calculations, and GPU acceleration was utilized to enhance the efficiency.
Early termination strategy of the traversal process: When calculating the valid entry points
for each entry point, the contribution of a new effective entry point
to the surgical channel
gradually decreases throughout the iteration procedure. Moreover, the physical size of the surgical tool is much larger than the spatial resolution of the entry points. Therefore, the contribution of the new traversed entry point
k was calculated:
When consistently falls below , it is assumed that the maximum surgical channel has nearly been established, and the iteration process is terminated. After projecting onto the control plane for , the pixelized effective area was processed using a morphological closing operation using a kernel of the physical size r to ensure continuity and integrity.
Control of the target area’s resolution and effective target extraction: During surgery, only the surface of the target needs to be exposed to allow access to the target, and the flexibility of the surgical target allows surgeons to reach the full target even with partial surface exposure. Meanwhile, the large number of target points increases the number of optimization parameters, slowing the convergence process. Therefore, target points were filtered to retain only effective points on the surface.
First, target points that were not oriented toward the approach direction were excluded. For each target , the normal vector at this point, , was estimated. For any potential entry point , the angle between the normal and the tool trajectory was calculated as . The count of k satisfying was recorded as . A target point t was deemed effective if it could be accessed from multiple entry points: . The resulting effective target points form .
After that, the accessibility of each was further verified under the constraints imposed by . To expedite this process, was resampled with a spatial resolution of , and the iteration terminated once any valid k was identified. This step further refined the target points into .
Given the convexity of the surgical target and the spatial distribution of the critical structures, is fully accessible if all the points on its boundary are reachable. Therefore, the target area was further compressed to its contour line .
Finally, was arranged in clockwise order along the boundary of as a closed curve and downsampled to using r as the spatial interval to ensure a partial overlap between adjacent access trajectories.
By refining the target points, approximately 20 target points remained for subsequent optimization, while the validity of the planned surgical channel could still be ensured. With 8 parameters per control function , only about 160 parameters were required to control the irregular volumetric milling area.
4. Experiments and Results
To comprehensively evaluate the proposed automated milling area planning algorithm, an experiment was conducted on six clinical CT volumes, comparing the results with those of manual planning. The dataset included three left-ear and three right-ear cases. Acoustic neuromas were used as the surgical targets in the planning process, with eight parameters controlling the maximum milling area for each target point. Additionally, the safe distances were set to 1.5 mm for the sinus dura, 2.5 mm for the facial nerve, and 0.5 mm for the other critical structures, based on guidance from an experienced surgeon. The time consumption and evaluation metrics for the surgical channel were recorded.
4.1. Time Consumption
The time consumption was measured on a graphics workstation with an Intel Core i9-13900K CPU, an NVIDIA GeForce RTX 4090 GPU, and Ubuntu 20.04.1 LTS. Multi-objective milling area optimization was implemented using PyMOO [
34], while the parallelizable components of the algorithm were implemented using CuPy [
35] for GPU acceleration. The time consumption for each step of the algorithm is shown in
Figure 7.
On average, 5840.1 ± 279.9 s was required for the entire planning procedure. Of this duration, 416.9 ± 10.7 s was spent on critical structure segmentation, 2958.5 ± 203.1 s on maximum milling area generation, and 2464.7 ± 74.4 s on milling area optimization.
4.2. Evaluation Metrics for Each Step
The quantitative evaluation for each metric before and after milling area optimization is presented in
Figure 8. Additionally, the generated target points for optimization, the planned milling volume, and the distance distribution from the volume surface to the critical structures are visualized in
Figure 9.
In general, target points oriented toward the approach direction of the surgical operations and the boundary used for the milling area were extracted from the labeled surgical target. The distribution of the distance to the surrounding critical structures on the surface of the planned surgical channel before and after optimization shows a substantial reduction in the areas near the critical structures after optimization. Quantitatively, the average number of voxels within a 1 mm distance from the critical structures decreased by 71.3% across all CT volumes.
The quantitative evaluation metrics for the planned surgical channel, including surgical target accessibility
, structure damage
, global spatial compactness
, and boundary smoothness
, were recorded before and after optimization (see
Figure 8). Generally, the planned milling area is better with a higher value for
, a lower value for
, and a higher value for
, while
is better when it is higher under the same milling volume and will increase as the milling volume increases. Following the optimization process,
,
, and
improved by 1.4%, 18.9%, and 3.5%, respectively, while
decreased by 32.3%. This reduction in
is proportional to the volume size. During optimization, the milling area was minimized to reduce structural damage, which consequently lowered
.
4.3. Comparison with Manual Planning
To verify the feasibility of the milling area planned using the proposed method further, we compared the planning results against those from manual planning performed by surgeons. An experienced surgeon was recruited to manually plan the milling area based on CT images. The planning procedure was conducted using 3D Slicer software with the segmentation editor. To ensure a fair comparison, the surgeon did not have access to the automated planning results, while the same critical structure segmentations were provided. Additionally, since the surgeon primarily used a slice-by-slice planning strategy, a Gaussian filter with a radius of 0.25 mm was applied to the planned results to fill any gaps in the volume.
A comparison between manual and automated planning is shown in
Figure 10. Generally, the automated planning effectively avoided high-risk areas within 1 mm of the critical structures. In each case, high-risk areas within 0.5 mm from the critical structures existed for the manual planning.
The minimal distances from the planned milling volume to each critical structure are shown in
Figure 11, averaged across all six datasets. Compared to the manual planning results, the minimal distances were increased by 0.94 ± 1.24 mm, 0.80 ± 0.43 mm, 0.90 ± 0.41 mm, 0.52 ± 0.48 mm, 0.79 ± 0.64 mm, and 0.51 ± 0.25 mm for the scala tympani, the sinus dura, the carotis, the malleus, the incus, the stapes, the facial nerve, the chorda tympani, and the outer ear canal, respectively. Average increases of 72.0%, 86.3%, 112.1%, 56.5%, 83.7%, and 50.6% were achieved.
Quantitative evaluations of both the manual and automated planning results were also compared. Unlike the previous comparison of the results before and after multi-objective optimization, the injury evaluation
was divided into two categories—damage to the temporal bone and damage to the scala vestibuli—since damage to the scala vestibuli is a critical metric in clinical cases. The evaluation results are presented in
Figure 12. Comparatively, damage to the scala vestibuli was reduced by 29.8%, while the milling volume increased by 29.3% with the automated algorithm. Additionally, the channel boundary smoothness was improved by 78.3%, and the global spatial compactness was improved by 26.4%. The corresponding constructability of the milling area was improved by more than 2.95 times in all cases. The performance in terms of target accessibility was similar for manual and automated planning.
To further compare manual and automated planning, an experienced surgeon was recruited to rate both planning results. This surgeon was different from the one who performed the manual planning. To ensure fairness, the surgeon was not informed about whether each plan was manual or automated. All 12 planned results were randomized in sequence for evaluation, and the critical structure segmentations were provided for reference. During evaluation, the surgeon could freely view each result on both 2D slices and 3D volumes using 3D Slicer. After examining each result, the surgeon completed a questionnaire with three questions focusing on injury risk, constructability, and accessibility:
Q1: Is the planned surgical channel likely to damage vascular, nerve, or any other risk structures during bone milling?
Q2: Is it possible to successfully construct the planned surgical channel using common neurosurgical procedures such as milling?
Q3: After the removal of the corresponding bone tissue, is it possible to successfully achieve the surgical target?
Each question was rated on a scale from 1 to 5, with 1 being the lowest and 5 the highest score. The results of this evaluation are presented in
Figure 13. On average, the manual planning results scored 4.2, 5, and 4.8 for the three questions, while the automated results scored 4.7, 4.3, and 4.3. The surgeon noted that manual planning posed a higher risk to the surrounding structures, potentially due to less comprehensive consideration of the spatial relationship between the milling area and the adjacent structures. Additionally, the surgeon reported that the automated planning results were clinically feasible.
5. Discussion
In this paper, we propose an automated planning method utilizing an optimization method to obtain a safe and executable surgical plan for the milling areas in mastoidectomy. To accomplish this, a template-based approach was employed to segment high-resolution critical structures surrounding the target operation area, using two templates for segmentation via a global-to-local registration process. Following segmentation, the maximum milling area was defined by simulating the access to the target during the surgical procedure. Re-categorization was then applied to the elements within the maximum milling area, with potential entry points projected onto a control plane to simplify the complex expressions caused by the irregular structural surfaces. Next, a set of control functions was introduced to deform the maximum milling area into a subset in a parameterized manner. Finally, the milling area’s constructability, the post-milling accessibility of the surgical target, and the injury risk to the patient were quantitatively evaluated. Multi-objective optimization was then employed to derive an optimal milling plan that met both the safety and feasibility requirements.
In general, the proposed method represents the irregular volumetric area as a combination of multiple cylinders aligned with the physical dimensions of the surgical tools, extending from the entry points to the target. The access trajectories are organized by each target point and are deformed on the entry side, ensuring the effectiveness of each spatial element, even within the deformed surgical channel. Additionally, the definition of the control functions with control points and interpolation preserves the volume’s smoothness. By adopting the maximum milling area as a deformation reference, spatial constraints are transformed into numerical limitations on the control parameters, making the irregular volumetric area suitable for optimization algorithms. Compared to previous works aiming at volumetric milling area planning, Popovic et al. [
21] neglected the thickness of the skull for heuristic generation, while McBrayer et al. [
17], Aghdasi et al. [
22], and Rajesh et al. [
2] pre-planned the milling area on an atlas and mapped it to new CT data for patient-specific milling areas. To our knowledge, our work is the first to achieve successful adoption of optimization methods in this procedure for more feasible milling area planning.
As a result, improved using the computation acceleration strategies adopted in this study, the proposed method could fully and automatically plan the milling area within 5840.1 ± 685.7 s. Due to the long time interval between CT scanning and the conduction of surgery, this time consumption already satisfies the clinical requirements. Moreover, it is still possible to improve the algorithm towards its real-time application. The time consumption of the current algorithm mainly lies in the generation of the maximum milling area and the multi-objective optimization. In both procedures, the spatial relationships between different targets need to be repetitively calculated, leading to a considerable amount of parallel computation, and might be improved by adopting a more powerful GPU. Moreover, the current implementation was based on Python, where a large overhead exists. Implementing the same method using C++ and CUDA, with careful organization to reduce repetitive calculations and the overhead, could significantly reduce the time consumption.
Compared to the manual planning results, our proposed method reduced the damage to the scala vestibuli by and increased the distance from the milling area to the surrounding critical structures by percentages ranging from to , significantly enhancing the safety of the milling procedure. Additionally, the boundary smoothness was improved by and global spatial compactness by when automated planning was used, making the planned milling area easier to construct during surgery. As the surgeon primarily adopted a slice-by-slice planning strategy, comprehensive consideration of the spatial relationship between different slices was lacking, leading to the reduced constructability of the planned volume. In contrast, our method generates a volume fully controlled by the entry and target points, theoretically ensuring that each spatial element is accessible during milling operations. In addition to the quantitative evaluation, a subjective assessment by an experienced surgeon was included. Although the automated plan requires more bone removal, the surgeon observed that the planned volumes maintained a safer distance from the surrounding structures and were feasible for real surgical cases. In most acoustic neuroma cases, the critical structures surrounding the lesion area remain unaffected; therefore, the generalizability of the proposed method will not be influenced by the level of the tumor lesion. However, evaluation on a more diverse dataset is still needed in future work.
Although this paper primarily focuses on mastoidectomy, the volumetric area modeling method is not limited to this clinical case. This approach—using entry and target points for volume representation and employing a maximum area and parameterization strategy for deformation—could also be applied in other situations where complex shapes and constraints need to be quantitatively represented.
Meanwhile, there are several limitations regarding the proposed method that need to be addressed in future studies. During the implementation of the proposed method, the surgical tool was modelized using a cylinder to simplify the calculation. Certain limitation might be raised due to this setup. For most neurosurgical tools, the three-dimensional spatial constraints can be simplified using a cylinder bounding box. However, a volumetric representation of the surgical tool might be needed when a more accurate description of the tool’s shape is needed, which would potentially increase the computational consumption of the proposed method. A way to consider the spatial constraints in such cases remains an open question. Moreover, the proposed method is also dependent on high-quality preoperative CT data and segmentation labels. How to provide a precise distribution of the critical structures when the imaging quality is limited also needs to be discussed. A way to integrate automatic planning into the clinical procedure is also a critical problem. Surgeons need to be aware of the possible risks of the milling plan and understand the evaluation metrics during automatic planning, and a mechanism to integrate the surgeons’ clinical experience during planning will be needed to ensure the safety of the surgery.
6. Conclusions
This paper introduced a modeling method for irregular volumetric areas, enabling automated planning of the milling area in acoustic neuroma surgery. By pre-generating the maximum milling area based on the spatial constraints imposed by the surrounding critical structures, the milling area was fully parameterized using several control functions and a deformation process. Additionally, the complex spatial constraints were translated into numerical limitations on the parameters. The milling area was evaluated using multiple metrics, including accessibility, constructability, and injury, while multi-objective optimization was employed to achieve the optimal planning results. Overall, the proposed planning procedure was able to generate a milling plan within a reasonable time, while the risks during bone milling were significantly reduced. The automatically planned milling areas also demonstrated smoother boundaries, enhancing their feasibility for real clinical applications. This modeling approach is applicable not only to acoustic neuroma surgery but also to other cases where complex relationships between a volumetric target and the surrounding structures need to be considered.