1. Introduction
The Cephalometric is a set of tracing planes generated by some combination of Cephalometric points in lateral cranial radiography; the measurement of the length between Cephalometric points and the angle formed by Cephalometric planes gives a descriptive and predictive tool of patient craniofacial development [
1]. The Cephalometric is a principal diagnostic method because it allows us to study skeletal, dental, and soft tissue aspects, dental-maxillary anomalies, dental misalignment, and malocclusion of dentoskeletal origin [
2]. Furthermore, it evaluates orthodontic treatment progress and surgical outcomes of dentofacial deformity treatment [
3].
Typically, Cephalometric analysis has been performed manually by tracing landmarks on patient radiography and measuring linear and angular variables. The technique is time-consuming and has several drawbacks by the dentists, including errors during hand tracing, landmark identification, and measurement [
4,
5,
6]. The software is not only used to assist measurement; it could be applied to aid the carrying out and planning of orthodontic treatment or orthognathic surgery, such as TFA [
7].
Around the world, many universities offer dental clinical services to local communities where undergraduate students practice. Typically, the patients have to attend one day to take the X-ray radiography and return another day to hear the diagnostics; this could be made more efficient if the dentists would reduce the time expended in measuring and analyzing the Cephalometry. The software application has demonstrated very usefully to minimize the time inverted.
There are several commercial alternatives to perform the technique; among them, we can mention Quick Ceph, Dolphin, Nemoceph, Vistadent, TraceCeph, CephX, and Facad. Using the software is very helpful for orthodontic practitioners in performing Cephalometric analysis and determining diagnostic and treatment plans. However, the cost of the purchase of this software is quite expensive [
8].
Some health institutions have the income to obtain a limit of commercial software licenses, and not all undergraduate dentists have access to computers with one of these licenses. In developing countries with low and medium-income would be challenging to buy a license.
Medical services are inaccessible for almost 100 million people who live in extreme poverty [
9]. Many governments and civil association campaign programs exist to carry these services to marginal communities. The accuracy and time expended are crucial factors in these campaigns.
The engineering faculty continually finds ways to give back to society with our science and technology knowledge. Software measuring the Cephalometric angles and distances based on vector algebra was designed. The software is offered as freeware and allows other programmers to use it as open source.
Cephalopoint is an open source software code programmed on Octave; each code applied vector and matrix operations to measure the Cephalometric traces according to Downs and Steiner’s Analysis. Octave was selected as the platform to develop the software because it has a set of commands to convert images, do matrix and vector operations, and plot tools in many ways. Furthermore, it offers the possibility to incorporate routines from other programmers to upgrade the code helping to do image processing to recognize the Cephalometric points or use special statistical libraries.
Measurements were made comparing the precision, repeatability, and time invested by the manual technique and software provided. Cephalopoint application showed accuracy in the results very close to manually; the time expended was a third part of the manual technique.
The measurements obtained by software technique were analyzed statistical variables. The values analyzed showed normal distribution and correct behavior.
2. Materials and Methods
2.1. Cephalometric Methods
The Lateral Cephalometric analysis is based on the correct position of the landmarks, as seen in
Figure 1.
The principal difference between Downs’ and Steiner’s Cephalometric is the use of different Cephalometric planes and angles.
Downs Cephalometric Analysis is the measurement to identify skeletal malformations by relating the various Cephalometric points and planes [
10,
11]. Tracking the planes and angles formed by Cephalometric points is necessary to determine the skeletal malformations, as shown in
Table 1. Downs’ analysis is typically used in children and adolescents who are still growing.
Steiner Cephalometric Analysis is the measurement of craniofacial structures based on the analysis by tracing and locating lateral facial landmarks; Frankfort plane (PoOr) is not used in this analysis [
1,
11]. The anatomic points formed the planes and angles to obtain the clinical data, as shown in
Table 2. Steiner’s analysis is typically used in adults whose growth is already finished.
2.2. Software Designed
The software was designed as an Octave’s Library, formed by four files: Two file codes to create the headers of Downs’ and Steiner’s landmarks of reservoir corresponding files. One file code to do Downs’ analysis and another Steiner’s analysis. The library is available to download from the Cephalopoint website [
12].
A graphical interface was necessary created to make the software friendlier. The version of Octave used was 6.2.0; it does not have an interface to create applications or bottoms like Matlab. Still, it is possible to emulate a graphic interface using a modal dialog box containing one or more text edit fields and returns the values entered by the user.
The logic used to program Cephalopoint is described by the following flowchart diagram, as shown in
Figure 2. Initially, select one of the Cephalometric techniques routine. The software opens a dialog box where the name of the JPG Cephalometric file, an output file to save the results, and the size of the scale bar would be written. Then the code opens the image selected and will ask for the landmark points. The code will calculate the size of the vectors and the angles between planes, applying the concepts shown in
Appendix A; finally, the measures will be saved at the output file previously selected.
The measurements would be saved to a comma-separated value (CSV) output file. It is helpful for dentists to perform statistical, clinical analysis, as it could be used in any spreadsheet.
2.3. Accuracy and Time Tests
Sample: Three different cases for each method were studied 32 times using manual and software techniques.
Manual Technique Materials: The research materials used were digital Cephalograms in JPEG format and printed. Research tools used were tracing paper, negatoscope, 4H pencil, adhesive (taped), protractor (Baum), rule, erasers, Hardware: Laptop HP ProBook Core i5 2.30 GHz, Memory RAM 4 GB, Intel HD Graphics Family; software Windows 7 and Octave. The object of this study was the digital X-rays obtained from Orthopantomograph OP300 of the patients. Downs and Steiner lateral Cephalometric analysis using conventional techniques was performed by tracing landmarks on tracing paper of the printed Cephalogram. On each sample Cephalogram, the determination of Downs’ and Steiner’s reference points (as seen in
Figure 1), lines and planes drawing, angle and distance measurement using protractor and rule were conducted.
Software Technique: The same cases and times were analyzed using Cephalopoint and saved in the respective output file.
Time Measure: The Manual and Software techniques were chronometric each time and compared the average obtained in each method.
Accuracy Measure: The mean (
m) and standard error (
) of each Downs’ and Steiner’s variables were calculated and compared for each case.
where
n is the number of samples and
S is the standard deviation.
Bland–Altman Analysis: It was proposed as an alternative analysis based on quantifying the agreement between two quantitative measurements by studying the mean difference and constructing limits of agreement. The Bland–Altman plot analysis is a simple way to evaluate a bias between the mean differences and estimate an agreement interval, within which 95% of the differences of the second method, compared to the first one, fall [
13]. The graph displays a scatter diagram of the differences plotted against the averages of the two measurements. Horizontal lines are drawn at the mean difference and the limits of agreement, as seen in
Figure 3a. The regression line of the difference could help in detecting a proportional difference [
14]. The plot allows to evaluate the global agreement between two measurements.
Normal distribution of the differences must always be verified by drawing a histogram, as seen in
Figure 3b. The statistical test should always be used to determine if the distribution is normal since, in some cases, normality cannot be determined simply by observing the histogram plot. A test for normal distribution (such as the Shapiro–Wilk test) can be done for the hypothesis that the distribution of the observations in the sample is normal (if
p < 0.05, then reject normality). If differences are not normally distributed, a logarithmic transformation of original data can be tried [
13].
Intra and inter-operator reproducibility were compared and analyzed using Bland–Altman analysis.
Repeatability Coefficient: The repeatability coefficient (
) is a precision measure that represents the value below which the absolute difference between two repeated test results may be expected to lie with a probability of 95% [
15].
2.4. Statistical Test
Patient Demographics: The study sample contains 84 patients (32 M, 52 F) treated in the Orthodontics Clinic of the university from Mar 2017 until July 2017. Two formed groups: Steiner’s analysis group selected 42 patients (20 M, 22 F); mean age ; ranging from 20 to 72 years. Downs’ analysis group set 42 patients (18 M, 24 F); mean age ; ranging from 7 to 19 years.
Sample Size Calculation: The sample size calculation had on Downs’ and Steiner’s norms variables and their standard deviation values. The significance level ranged at 5% and a confidence level of 95%. Thus, a sample of 36 patients in each group would give 88% power under these circumstances. In consequence, 42 patients were established as the sample size for both groups.
Kurtosis: (
K) is a statistical measure that is used to describe the distribution. Kurtosis measures extreme values in either tail. Distributions with large kurtosis exhibit tail data exceeding the tails of the normal distribution. Distributions with low kurtosis exhibit tail data that are generally less extreme than the tails of the normal distribution [
16].
The excess kurtosis (
) is defined as kurtosis minus 3. There are three distinct regimes as described below.
There are three categories of kurtosis that a set of data can display. All measures of kurtosis are compared against a standard normal distribution [
17].
Leptokurtic distribution
. Leptokurtic has very long and skinny tails, which means there are more chances of outliers. Positive values of kurtosis indicate that distribution is peaked and possesses thick tails. An extreme positive kurtosis indicates a distribution where more numbers are located in the distribution’s tails instead of around the mean (
Figure 4a).
Mesokurtic distribution
. Mesokurtic is the same as the normal distribution, which means kurtosis is near to 0. In Mesokurtic, distributions are moderate in breadth, and curves are a medium peaked height (
Figure 4b).
Platykurtic distribution
. Platykurtic having a lower tail and stretched around center tails means most data points are in high proximity with mean. A platykurtic distribution is a flatter (less peaked) when compared with the normal distribution (
Figure 4c).
To establish a criterion of decision between which values of
could be considered as a mesokurtic distribution, the
adjusted Fisher-Pearson standardized moment coefficient of skewness (
) was considered.
The lower and upper limits were established as 90% range for sample skewness coefficient
[
18].
The Shapiro–Wilk test (
W) is a way to tell if a random sample comes from a normal distribution. The test gives a
W value; small values indicate the sample is not normally distributed (you can reject the null hypothesis that the population is normally distributed if your values are under a certain threshold) [
19]. The domain of
W is restricted to the interval (0,1).
where,
(with parentheses enclosing the subscript index
i; not to be confused with
) is the ith order statistic;
are constants generated from the covariances, variances, and means of the sample (size
n) from a normally distributed sample.
The coefficients
are given by:
where
C is a vector norm:
Furthermore, the vector
u,
It is made of the expected values of the order statistics of independent and identically distributed random variables sampled from the standard normal distribution; finally,
V is the covariance matrix of those normal order statistics [
20].
Since you use tables the
W will be between two values:
and
, and the
p-value (
p) between the relevant two
p-values:
and
. Calculate the approximate
p-value using a linear ratio.
The null hypothesis of this test is that the population is normally distributed. Thus, if the , then the null hypothesis is rejected and there is evidence that the data tested are not normally distributed. On the other hand, if the , then the null hypothesis (that the data came from a normally distributed population) can not be rejected.
The data was analyzed using an Octave’s code programmed by Nir Krakauer [
21] based on the study of Patrick Royston [
22]. This code was validated comparing the value of
calculated with
observed on the study of Shapiro and Wilk [
23]. Other data tables were analyzed, and the correspondence between these values is excellent.
3. Results
The results of the Cephalometric Manual and Software techniques were analyzed according to the tests proposed. The results are shown according to each test.
3.1. Time Test
Table 3 shows the average time used by each case and method after 32 times.
The average time to realize all manual techniques was 17.4 min per patient compared with the average time to realize all software techniques was 6 min per patient.
3.2. Accuracy Test and Techniques Comparison
3.2.1. Downs’ Cephalometric
Case 1 (
Figure 5a) was classified as skeletal class I, normal chin with suggestive increased lower incisor proclination. Case 2 (
Figure 5b) was classified as skeletal class II, suggesting a prominent maxillary denture base with the mandible and unfavorable face, also indicates vertical growth pattern of the mandible, and suggestive of increased lower incisor proclination. Moreover, finally, case 3 (
Figure 5c) was classified as skeletal class III, suggesting a prominent maxillary denture base with the mandible. These diagnostics were verified applying Downs’ conventional and digital techniques.
Three different cases were used to analyze Downs’ Cephalometric, as shown in
Figure 5. The 32 repetitions of each patient were necessary to estimate the mean of the variable and standard error, as shown in
Table 4.
As seen in
Table 4, Downs’ Cephalometric measurements obtained manually and software technique were very close. The mean absolute error (MAE) was calculated from standard errors to explore the precision of applying Cephalopoint. The MAE estimated for the manual technique was 6.73%, while the software technique was 4.73%.
Bland–Altman plot indicates the average difference between the two readings.
Figure 6 shows the Bland–Altman intra-operator reproducibility to Downs’ method.
The plots show all possible combinations between techniques and cases for Downs’ Cephalometric. As seen in
Figure 6, the distribution is very similar to the example of the normal distribution shown in
Figure 3.
Intra- and inter-operator were calculated for each case to Downs’ analysis, as shown in
Table 5. Our observation clearly showed the low mean correlation (<0.6) for intra- and inter-operators reproducibility between all cases and techniques applications, showing the randomness between the measurements. Shapiro–Wilk test showed normal distribution (W close to 1) for intra- and inter-operators; this confirms the assumption was made previously.
The values of the repeatability coefficients are higher than the limit of 95% confidence interval showed for each combination in the Bland–Altman plot, as seen in
Figure 6.
3.2.2. Steiner’s Cephalometric
Case 1 (
Figure 7a) was classified as Basifacial with the recessive location of the maxilla, mandibular prognathism, negative growth pattern, skeletal open bite case, class III skeletal tendency, and class III molar. Case 2 (
Figure 7b) was classified as Dolichofacial with a negative growth pattern and class II div 1 dental case. Moreover, finally, case 3 (
Figure 7c) is classified as Mesiofacial, class I skeletal tendency, and class I dental molar. These diagnostics were verified by applying Steiner’s conventional and digital techniques.
Three different cases were used to analyze Steiner’s Cephalometric, as shown in
Figure 7. The 32 repetitions of each patient were necessary to estimate the mean of the variable and standard error, as shown in
Table 6.
As seen in
Table 6, Steiner’s Cephalometric measurements obtained manually and software technique were very close. The mean absolute error (MAE) was calculated like Downs’ Cephalometric cases. The MAE estimated for the manual technique was 5.04%, while the software technique was 3.72%.
Bland–Altman analysis was also employed with Steiner’s Cephalometric measurements. The Bland–Altman plots intra-operator reproducibility to Steiner’s method are shown in
Figure 8.
As expected, the plots show all possible combinations between techniques and cases for Steiner’s Cephalometric. As seen in
Figure 8, the normal distribution in all Bland–Altman plots is very clear.
Intra- and inter-operator were calculated for each case to Steiner’s analysis, as shown in
Figure 8. Our observation demonstrated the low mean correlation (>0.6) between intra- and inter-operator reproducibility for all cases and techniques applications, showing the randomness between the measurements. Shapiro–Wilk test showed normal distribution (
W close to 1) for intra- and inter-operators; this confirms the assumption was made previously, as seen in
Table 7.
The values of the repeatability coefficients are higher than the limit of 95% confidence interval showed for each combination in the Bland–Altman plot, as seen in
Figure 8.
3.3. Statistical Test
The mean, standard deviation, kurtosis, p-value, and Shapiro–Wilk test were the statistical variables calculated for each Cephalometric method applying Cephalopoint.
3.3.1. Downs’ Cephalometric
A group of 42 patients (18 M, 24 F); mean age ; ranging from 7 to 19 years were measured using Downs’ Cephalometric and analyzed using statistical variables.
All variables of Downs’ Cephalometric were non statistically significant difference
, in combination with
, as seen in
Table 8, the variables showed normal distribution. After analyzing the excess kurtosis, all the variables exhibited Mesokurtic distribution (
), confirming normally distributed measurements.
3.3.2. Steiner’s Cephalometric
A group of 42 patients (20 M, 22 F); mean age ; ranging from 20 to 72 years were measured using Steiner’s Cephalometric and analyzed using statistical variables.
All variables of Steiner’s Cephalometric were non statistically significant difference
, in combination with
, as seen in
Table 9, the variables showed normal distribution. After analyzing the excess kurtosis, all the variables exhibited Mesokurtic distribution (
), confirming normally distributed measurements.
4. Discussion
Cephalopoint could significantly reduce the time employed by the dentist to analyze the Cephalograms. In this sense, the diagnostic would be almost instantly, and the clinics will be more efficient. On the other hand, if campaign programs in low-income communities where the provisional clinic would be installed, electric energy will be necessary.
The mobility could be improved by applying the same principles to create a mobile application. Thus, these clinical campaign programs would be expanded to the most remote communities.
These codes would be migrated to another programming language like Python, offering the advance of not needing software, like Octave, to run each code. Furthermore, Phyton offers the possibility to improve the graphic interface to be more friendly.
On the other hand, Octave is the programming language more used by the Mathematics Academy of Engineering Faculty members, prioritizing that other researchers, teachers, and students can join the project to improve it. Several routines on Octave and Python were running and compared, and both were very similar in precision and time employed [
24,
25]. Both languages are excellent open source alternatives.
Cephalopoint was designed to offer a free alternative to analyze with Downs’ and Steiner’s Cephalometric techniques. The use of open source software has several advantages. Other users could modify it to their specific needs, either working with Cephalometrics or analyzing further X-ray radiographs. Both the interface and the code are constantly evolving thanks to the internet community. The success of open source software development has increased the number of companies seeking this development model. Many software companies have approached the open source software community to work collaboratively to increase the use of these programs [
26].
Cephalopoint could be improved by adding a routine to compare the measurements with the standard values for each variable and return the difference. Another to plot the Cephalometric points and planes instantly after was declared. The use of filters to clean up the image could improve the selection of Cephalometric points.
The image analysis techniques on the Cephalograms will be applied to recognize patterns and thus detect the different landmarks of each Cephalometric method. A first approximation could be applying an artificial neural network to acknowledge texture characterization as Fibrosis and Carcinoma in lungs [
27], in the same way, we will seek to recognize the Cephalometric landmarks.
The same analysis to calculate the angles and distances between the reference points could be used in other fields of medicine, where the study of radiographs is involved. For example, to study bone thickness in magnetic resonance imaging of bodies postoperative vertebrae L4 and L5, using the proposed technique to evaluate the treatment of pyogenic spondylodiscitis [
28]. To measure the angle of pelvic tilt in sagittal radiograph to determine the evolution of scoliosis deterioration [
29]. Furthermore, to offer an alternative software to measure the orbital volume after Enucleation with Orbital Implant [
30].
Many studies compared the manual with digital lateral Cephalometric using license software. Bastos Paixão et al. employed Dolphin and analyzed with t-Student distribution [
31], Thurzo et al. employed Dolphin and analyzed with Bland–Altman analysis. Esteva Segura et al. [
32] employed Nemoceph Nx and analyzed with t-student distribution [
33], Gartati et al. employed CephNinja and analyzed with t-Student distribution [
8], Mohan et al. employed OneCeph and analyzed with t-students distribution [
34], among other authors. All the studies confirmed no significant differences and normal distribution for all Cephalometric variables measured by software. Furthermore, the error standard of each variable was reduced, exhibiting the argument of Alburquerque-Junior and Almeida [
35] and Chen et al. [
36]: “the computerized method is reliable as it exhibits lower error variance than the conventional method”. In the present document, both issues were confirmed by applying open source software of own design.
5. Conclusions
Cephalopoint could be offered some advances and problems compared with other commercial software. However, this software is not trying to compete but offers a low-income alternative with good accuracy and repeatability.
The average time employed to do Cephalometric analysis with the software was reduced to the third part concerning manual technique. The measurements were calculated almost in an average time of 6 min. Thus, the software could help the dentist reduce the time required to perform a diagnostic.
On the other hand, using the file where all the measurements are saved reduces the time to carry out statistical studies and organize patient information. Furthermore, the CSV format creates universal files that could be worked in any spreadsheet or statistical software.
Another advantage of the software is the appreciable decrease in error measurement considering the manual technique in Down’s method from 6.73% to 4.73% and Steiner’s method from 5.04% to 3.72%, which was to be expected as commented in the previous section.
Table 5 and
Table 7 show the comparison with the manual and software technique using the Bland–Altman analysis. It exhibited that for both the Downs and Steiner methods, the correlation is very low, so it can be considered that the measurements in the different cases are entirely independent of each other. The Down’s method (
) and Steiner’s method (
) were calculated using the Shapiro–Wilk test; in both, the behavior of all different combinations of differences between measurements and cases, in other words, the infra- and intra-operators have a normal distribution. This assumption was confirmed in Bland–Altman plots, as seen in
Figure 6 to Downs’ technique and
Figure 8 to Steiner’s technique.
The repeatability of each case analyzed by Down’s and Steiner’s methods applying Cephalopoint was very high, as seen in
Table 5 and
Table 7.
About statistical analysis, all the variables analyzed Down’s and Steiner’s methods applying Cephalopoint showed normal distribution, as seen in
Table 8 and
Table 9. Consequently, the measurements obtained did not present statistical biases, so Cephalopoint is a good instrument for clinical statistical analysis.
The free software designed, Cephalopoint, is a good instrument to apply Down’s and Steiner’s Cephalometric methods, exhibiting enough accuracy, precision, and repeatability, with the possibility to be employed for clinical studies. It adds one more possibility to the list of software that can perform this type of measurement.