As already mentioned, one of the key features motivating our choice of using the CGP is the possibility of obtaining some insights into the rationale behind the classification process. Indeed, by decoding the genotype obtained at the end of the training phase, one can obtain the explicit classification model inferred by the CGP. This model, in turn, allows one to achieve two goals: to highlight the most informative features, and by interpreting the identified relationships, to obtain more efficient guidelines for the diagnosis of the disease.
Looking at the explicit models obtained at the end of 19 out of 20 cross validation runs, a general model can be extracted. This model (reported in Algorithm 1) takes into account the relationship among three features (
,
, and
) and a combination of other features, indicated with
. This suggests that, regardless of the data used for training the CGP, an underlying scheme is always present in the obtained models, which involves three main global features:
and
, related to the global difference between handwritten and template trace, and
, related to the maximum extension of the handwritten trace. One model out of twenty was more complex and was not characterized by the same underlying scheme. This model was the one with the worst performance.
Algorithm 1 The general model inferred by using all the best models at the end of each run. can contain both local and global features. |
if then |
output = ”control”; |
else |
output = ”patient”; |
end if |
Table 6 reports the occurrence of the features in the models evolved by the CGP across all the runs. The data shows that, among the features included in the expression
,
is always present, while the other ones are exploited for further refinements of the classification criteria, and presumably, are less informative then the others in highlighting the graphical signs caused by the disease.
In order to evaluate whether the features which occurred least (
and
) were those carrying less information about the handwriting signs of the disease, we excluded them from the training phase of the CGP. As expected, since those features rarely contribute to the definition of the evolved models, information carried by them does not capture distinctive signs of the disease, and in some cases, could represent a noise component for the classification process. Indeed, looking at the results reported in
Table 7, it can be observed that, on average, classification performance is not significantly different in the three conditions and the standard deviation of the results decreases by removing the features.
Furthermore, looking at the relationship among features included in the best performing model (see
Figure 4), some guidelines for the diagnosis of the disease can be extracted. First of all, the model analyzes the relationship between six features, dropping three of them, namely
,
, and
. This suggests that an overly fine characterization of the trace drawn by the subjects, such as the one measured by
, negatively affects the performance and therefore should be avoided. From a numerical point of view, the order of magnitude of the values of the features
and
is
, for
and
is
, for
is
, and eventually, for
is
. Because
and
are on different sides of the inequality, and because
values are negligible, the decision mainly depends on the relationship between
and the difference between
and
. In the case of PD patients, the tremor, measured by
, must be larger than the maximum distance between the template and the written trace, measured by
, minus the standard deviation of the written trace radius, expressed by
. This algebraic relationship may be linked to signs clinically associated to Parkinson’s disease by noting that the higher the values of
and
, the larger the difference between the template and the written trace, and therefore their difference can be used to characterize the skill of the subject in following the template. Thus, the model learned by the CGP suggests that, for diagnostic purpose, the tremor plays a crucial role, but in relation to the subject skill in tracing the reference pattern. Therefore, a large value of the tremor by itself does not suffice for a PD diagnosis, as there may be subject exhibiting high values of
not because of the pathology, but because of their poor motor skills, which leads to larger values for
,
, and
. Eventually, it is interesting to note that, in comparison with the best model discovered by the CGP, the simplest model (Algorithm 3) completely gets rid of the feature
, which is perfectly plausible once the numerical values are considered.