1. Introduction
The cornea has over 16,000 nerve terminations/mm
3 and is the most densely innervated tissue in the human body [
1]. Corneal nerves originate from the nasociliary branch of the ophthalmic division of the trigeminal nerve and divide into smaller branches that predominately innervate the anterior and middle of the stroma. Thereafter, they penetrate Bowman’s layer perpendicularly, then turn perpendicularly once again before branching into one or more subbasal nerves, forming the subbasal nerve plexus which runs between the Bowman’s layer and the basal corneal epithelium, parallel to the ocular surface. Corneal nerves consist of myelinated Aδ fibres of larger diameter and unmyelinated, beaded C fibres of smaller diameter, which provide important sensory functions and play an important role in maintaining the functional integrity of the ocular surface by releasing trophic mediators that promote corneal homeostasis [
2,
3].
Refractive errors are the leading cause of reversible visual impairment worldwide [
4]. Laser refractive surgeries are the most common surgical procedures performed to correct refractive errors and achieve spectacle independence. Established as safe and effective procedures associated with excellent visual outcomes, improvements in quality of life, and high patient satisfaction [
5], it is of no surprise that they are now one of the most commonly performed ophthalmic surgeries globally [
6]. Laser in-situ keratomileusis (LASIK) and small incision lenticule extraction (SMILE) are two common laser refractive surgery techniques. In LASIK and SMILE, the flap or cap creation, stromal ablation/lenticule creation/extraction inevitably transect corneal nerves, leading to corneal denervation. This results in a decrease in corneal sensitivity and dry eye secondary to the neurotrophic cornea. Differences in surgical techniques between SMILE and LASIK are thought to have different repercussions on corneal nerve denervation and subsequent corneal nerve regeneration. The corneal nerve perforation sites, where the stromal nerve fibres perforate the Bowman’s layer and become the subbasal nerve plexus, are predominantly found in the mid-peripheral corneas [
7]. Hence theoretically, in SMILE, with an incision of only 2.1–4 mm, as opposed to the circumferential flap (22 mm) in LASIK, the subbasal and stromal nerve fibres can be better preserved.
In-vivo confocal microscopy (IVCM) has been used extensively to study the cornea at a cellular level, and this includes the corneal subbasal nerve plexus. On IVCM evaluation, patients undergoing refractive surgery may present with a decrease in corneal nerve density, diameter and total length, as well as an increase in corneal tortuosity, branching and beading [
8,
9]. With the growing popularity of corneal refractive surgeries, much research has been devoted to minimising postoperative ocular surface disturbance via better preservation of corneal nerves through the development of new surgical technology and techniques [
10]. Hence, there is a need for a reliable and reproducible quantitative tool to evaluate the extent of corneal denervation and subsequent nerve regeneration following refractive surgery. Assurance of the objective reproducibility of nerve metric evaluation would allow for reliable translation or correlation of corneal nerve analysis by IVCM images to subjective and objective clinical outcomes, such as dry eye assessments.
Currently, analytical software for corneal nerves range from being fully manual (CCMetrics; University of Manchester, Manchester, UK), semi-automated (NeuronJ; plug into ImageJ, NIH, Bethesda, Maryland, USA), to being fully automated (ACCMetrics; University of Manchester, Manchester, UK), all of which produce varying degrees of nerve quantification [
11]. CCMetrics contains an interactive graphical interface which allows users to manually identify and trace nerve fibres, as a built-in algorithm accumulates and tabulates a number of nerve fibres and branches, as well as nerve fibre length and tortuosity [
12]. Neuron J involves the manually-aided tracing of fluorescently labelled neurons using an algorithm which compares pixel intensity of neurons with neighbouring pixels, thus guiding manual tracing along an approximated length of neuron, in addition to tabulating quantities [
13]. ACCMetrics uses a fully automated algorithm that enhances IVCM images, then distinguishes nerve fibres from the background using contrasting neighbouring pixels and quantifies them [
14]. CCMetrics was previously used in several research papers to evaluate corneal diabetic neuropathy [
15,
16]. However, the laborious and time-consuming nature of the technique coupled with the need for trained expertise inspired the development of ACCMetrics, which utilises an algorithm to identify and quantify nerve fibres automatically and quickly. The convenient nature of ACCMetrics quickly led to its increasing popularity. Therefore, a comparison of different nerve analysis systems to determine the correlation and agreement of results between the software programmes would be beneficial in ensuring consistency in nerve quantification across different research work. This will allow for a fair investigation and comparison of nerve damage and regeneration following various surgical techniques, allowing optimisation of surgical techniques which can minimise nerve damage and resultant ocular surface disturbance in high-risk patients.
In the present study, we aimed to validate the use of automated and manual quantitative analysis of the corneal nerve plexus for patients who had undergone refractive surgery. We first evaluated the repeatability and reproducibility of manual quantitative analysis and then assessed the agreement and correlation between the automated and manual quantitative analysis.
2. Materials and Methods
The population of this study consisted of 20 patients (40 eyes) who were randomised to receive SMILE in one eye and LASIK in the other eye, performed in the Singapore National Eye Centre during the period between May 2012 and November 2016. This study was part of a registered randomised controlled trial (RCT; NCT01216475). Approval for the study was granted by the institutional review board of SingHealth, Singapore (reference number: 2011/109/A), and the study was conducted in accordance with the Declaration of Helsinki. LASIK and SMILE procedures were performed as previously described [
17,
18]. In brief, a superiorly hinged 120 μm thick flap was created using the Visumax femtosecond laser (Carl Zeiss, Jena, Germany) in the LASIK procedure, followed by excimer laser ablation (Wavelight Allegretto WAVE Eye-Q 400 Hz, Wavelight GmbH, Alcon, USA). For the SMILE procedure, the Visumax femtosecond laser with the following parameters was used—3.2 mm incision, 120 μm cap thickness, 7.5 mm cap diameter, 6.5 mm optical zone, and 145 nJ power with side-cut angles at 90 degrees. After careful dissection of the anterior and posterior planes of lenticule, the lenticule was extracted through the small incision.
The IVCM examination was performed at an average time of 4.3 ± 1.7 years after surgery (range 2.1–6.5 years). IVCM was conducted using a confocal scanning microscope, the Heidelberg HRT3 Rostock Cornea Module (Heidelberg Engineering GmbH, Heidelberg, Germany). Images generated were 2-dimensional, consisting of 384 × 384 pixels covering a field of 400 × 400 um. The cornea of the patient being examined was instilled with a drop of topical anaesthetic 0.4% benoxinate hydrochloride (Oxybuprocaine hydrochloride, Minims; Bausch and Lomb). The patient was then instructed to fixate on the flashing light of the instrument. The objective tip was advanced forward until gentle contact was established between the gel and the cornea. Both corneas of each patient were examined in five different areas: The central cornea was scanned first, and then the patient was asked to change their gaze to scan the superior, inferior, nasal and temporal part of the cornea (approximately 3 mm away from the corneal apex for each). The patients were instructed to fixate on a light source from a different direction with the contralateral eye to stabilise the scanning view.
For each scanned area, five best focused and most representative images from different depths of the subbasal nerves were selected. Each nerve (main trunk or branched nerve) was selected only once (i.e., the same nerve fibres were not repeatedly selected) to get a better representation of the subbasal nerve plexus. The 25 selected IVCM micrographs for each eye were evaluated using both manual and automated image analysis software (CCMetrics and ACCMetrics, respectively; University of Manchester, Manchester, UK). All the images were evaluated by two trained, independent, masked observers (JYC, LWYY) with CCMetrics for three nerve parameters: Corneal nerve fibre density (CNFD; number of main nerve fibres/mm
2); corneal nerve fibre length (CNFL; total length of all nerve fibres in mm/mm
2); and corneal nerve branch density (CNBD; number of branch points on the main fibres/mm
2), and inter-observer reproducibility was evaluated. In CCMetrics measurements,
all visible nerves were traced with a manual drawing module, by manually marking the nerve fibres with red lines, nerve branches with blue lines, and by putting green dots to the branching points, defined as the points intersecting the fibres and branches. Two weeks after the completion of the first evaluation, both observers repeated the analysis for all the images, and intra-observer repeatability was assessed [
19]. All the images were also analysed with ACCMetrics, and the agreement and correlation between the ACCMetrics and CCMetrics measurements was evaluated.
The data were analysed using STATA (STATACorp, TX, USA) and NCSS (LLC, MI, USA), and was presented as mean ± standard deviation. Bland-Altman plots were employed to determine intra- and inter-observer agreements between the measurements. The repeatability and reproducibility values were calculated in terms of mean bias and 95% limits of agreement (LoA). The intraclass correlation coefficient (ICC) was determined as an index of repeatability and reproducibility between measurements. Paired t-tests were used to assess the differences between intra-observer and inter-observer measurements. A Passing-Bablok regression was used to evaluate the relationship between the ACCMetrics and CCMetrics measurements. p < 0.05 was considered statistically significant.
4. Discussion
In the present study, we demonstrated that both automated and manual nerve analytic software were reproducible and had good agreement in the evaluation of corneal nerves status following refractive surgery, particularly in the assessment of CNFL and CNFD. However, we found that automated ACCMetrics attained lower values compared to manual CCMetrics, especially when the fibre density or fibre length was higher. CNBD, compared to other parameters, had lower but still acceptable intra-observer and inter-observer reproducibility.
Existing nerve analysis software can be broadly categorised into manual, semi-automatic and fully automatic, each with varying disadvantages and advantages [
11]. The three most commonly utilised nerve analysis software programmes in each category are CCMetrics for manual quantification, NeuronJ for semi-automated quantification and ACCMetrics for fully automated quantification. ACCMetrics utilises a fully automated algorithm which identifies and quantifies nerve fibres in eight different parameters, compared to the four parameters quantified by CCMetrics (CNFD, CNFL, CNBD and tortuosity coefficient). The four additional parameters, nerve fibre area, nerve fibre width, nerve fibre orientation histogram and nerve fibre width histogram, are advantageous in that they provide additional dimensions and allow for more detailed comparisons in nerve analysis. Automated quantification consists of two steps. Firstly, images are enhanced, and nerve fibres are detected by a dual model feature descriptor combined with a neural network classifier which distinguishes nerve fibres from background noise and underlying connective tissue [
20]. Next, the software quantifies the morphometric nerve parameters. Each image takes an average of approximately 15 s for the complete analysis, much shorter than the 10–20 min taken for manually analysis depending on the complexity of the image [
21]. The automated software is, thus, efficient in that it saves the analyst time, removes the need for an expert and exhibits objectivity in nerve identification, especially if nerve fibres are of good calibre and demonstrate good contrast from the background [
22]. This would also eliminate the need to account for inter-observer and intra-observer variability and reproducibility in ascertaining the validity and reliability of the results.
However, IVCM corneal nerve images contain a wide range of nerve fibres of differing contrast from the background. Thicker nerve fibres tend to show stronger contrast from the background and are more easily identified by the automated algorithm. On the other hand, nerves that are not in focus, as well as thinner fibres that tend to appear fainter, may be missed out due to their poor contrast in especially in images with noisy backgrounds [
22]. These false-negative errors may result in an underestimation of the nerve parameters. In contrast, manual software CCMetrics utilises an algorithm to tabulate nerve statistics following manual identification and tracing of nerve fibres on an interactive graphical interface of the nerve images. Nerve identification and tracing depends on the user’s trained discretion rather than automated algorithms. These differences in software explain the underestimation of nerve parameters by ACCMetrics seen in our study, in which we observed a greater percentage of underestimation error as CNFL values increased. ACCMetrics may also be associated with several overestimation scenarios occasionally, resulting from mistaken recognition of some non-nerve related structures, such as light reflections or dendritic cells that display sufficient contrast to fulfil the criteria of the algorithm, although these occurred less frequently in our experience.
However, manual marking is time-consuming and labour-intensive with an analysis time of approximately 20 min per image. Given its laborious nature, manual tracing is subject to user fatigue, increasing the possibility of unreliable tracing, and lowering productivity due to the limitation in the number of images analysed per day [
23]. In addition, it is highly user-dependent and requires the expertise of the observer in both identifying and tracing the tortuous nerve contours. Thus, the subjective nature of nerve identification is more obvious in manual quantification, resulting in inconsistencies and a need for both inter-observer and intra-observer analysis [
23]. This is illustrated by the worse inter-observer agreement compared to the intra-observer agreement. In our study, we also showed that the inter-observer agreement between the second measurements of all the parameters measured by CCMetrics was better than that between the first measurements. This is not surprising given that manual nerve tracing accuracy improves with user experience, as repetition and exposure allows for familiarity and refinement of nerve identification and tracing skills.
Several previous published studies have compared manual and fully automated methods of nerve quantification, but all of them have been limited to patients with diabetes mellitus (DM). The pathogenesis of DM corneal neuropathy is different from that of refractive surgery. DM causes a slow degenerative neuropathy [
24], while corneal denervation following refractive surgery involves acute nerve denervation. Furthermore, DM patients show minimal nerve regeneration if any, even after systemic stabilisation. In contrast, post-refractive surgery patients demonstrate nerve regeneration, which may take several years [
25]; hence, it is imperative that there exists a reliable analytic tool to quantify these changes. Since there is an inherent underlying difference in the mechanism of nerve damage and repair between refractive surgery and DM corneal neuropathy, it is important to study the validity between the automated and manual software to ensure consistency in nerve quantification in refractive surgery.
Dehghani et al. reported that automated methods produced underestimated CNFL results compared to manual methods in the patients with diabetic corneal neuropathy, but the results obtained from both methods were in good agreement and correlation, with an ICC value of 0.86 and Pearson correlation coefficient of 0.87 [
22]. In our study, we showed good agreement in CNFL measurements obtained by CCMetrics and ACCMetrics, with an ICC of 0.789, which was lower than that reported by Dehghani et al. A possible reason for this might be the difference in the mechanism of repair of corneal subbasal nerves, which primarily results from the nerve regeneration capability following refractive surgery that does not occur in progressive DM. Thin and short regenerating nerves following refractive surgery have a higher tendency to be missed by ACCMetrics in our experience. In another study on DM corneal neuropathy, Ostrovsky et al. showed good correlation and agreement for CNFL between manual and automated software, with a Pearson correlation coefficient of 0.82 [
21]. However, the authors pointed out that there was an absolute measurement bias, corresponding to a 21% underestimation of absolute CNFL values using ACCMetrics. In our paper, ACCMetrics had an average underestimation of 3.582 mm/mm
2 in CNFL, corresponding to a 32% underestimation of absolute CNFL values. The higher underestimation in the present study might have arisen from the fact that our study population had higher CNFL (11.3 ± 5.8 mm/mm
2;
Table 1) than the DM population (9.58 ± 2.19 mm/mm
2) [
21], and the effect of underestimation is greater with the increase in CNFL values (
Figure 3A).
CNFD measurements demonstrated good agreement between ACCMetrics and CCMetrics with an ICC value of 0.811. This supports the premise that ACCMetrics is capable of identifying well contrasted, thicker nerves, of which long main nerve fibres typically possess at least one characteristic along a significant portion of its length, and are thus, correctly identified as main nerves. CNFD measurements also demonstrated good intra-observer and inter-observer agreement using CCMetrics, with average ICC values of 0.757 and 0.728, respectively. There was little difference between manually obtained CNFD values as main nerves are typically long and easily identifiable manually. Small differences between observers arose mainly in images with many conjoined long nerves, in which there was a difference in opinion regarding whether the nerves were main nerves or long branching nerves.
Among the three parameters, CNBD showed the worst agreement between ACCMetrics and CCMetrics, as well as the worst intra-observer and inter-observer agreement in CCMetrics measurements with ICC values of 0.642, 0.653, and 0.591, respectively. This is consistent with a previous DM corneal neuropathy study, in which the authors demonstrated that intra-observer and inter-observer repeatability and consistency were better for CNFD and CNFL, compared to CNBD (inter-observer ICC 0.54; intra-observer 0.61) [
26]. We postulate that the poor contrast of nerves from background contributed to the discrepancy in identifying branch points. ACCMetrics is sometimes unable to identify the full length of main nerves where they become thinner or fainter, causing it to miss branch points extending from that portion of the main nerve. In addition, branches which demonstrated poor contrast either in their full length or near their branch points were also not identified by ACCMetrics (
Figure 1C). The poor agreement between observers and software can also be attributed to the variability regarding disputable nerve structures in which the distinction between clearly defined nerve parameters was unclear. Common problems involved two crossing fibres, which could be interpreted as a single branching fibre by one user and as two fibres without branches by the other user. These reasons could explain the propensity for errors in the CNBD measurements.
The inherent limitation of current IVCM scans is that it provides a small region of interest (<500 μm
2) and is not easy to standardise scanning location and depth each time. The distribution of corneal nerve plexus also varies in the central and peripheral cornea [
27]. Hence, it is important to standardise the scanning and analysing protocol when utilising IVCM for research. Automated mapping or montaging techniques for images [
28], as well as wide-field or large-area scanning of corneal subbasal nerve plexus [
29], may be helpful in the future.