1. Introduction
Neurodegenerative disorders affect the structure and functions of brain regions resulting in a progressive cognitive, functional and behavioral decline. Among them, Parkinson’s disease (PD) is one of the most common and most disabling. It is mainly characterized by motor symptoms, including akinesia, bradykinesia, rigidity and tremor, as well as non-motor deficits, such as depression, sleep disorders and dementia [
1]. Today, there is no cure and a precise diagnosis is possible only
post-mortem. Nevertheless, an early diagnosis of PD would be crucial in the perspective of the proper medical treatment to be administered and for evaluating the effectiveness of new drug treatments at prodromal stages.
A growing interest has thus arisen in the scientific community in e-health, particularly in computer aided diagnosis (CAD) systems. Such systems, in fact, have the potential to assist clinicians at the point of care, providing novel diagnostic tools, while reducing the expenditure of public health care. Of course, CAD systems are not expected to replace standard techniques, but to provide complementary approaches to standard evaluations that are non-invasive and very low-cost.
As handwriting difficulties in PD patients have been documented since a long time, a promising
biomarker concerns the changes in handwriting due to the concomitant impairment [
2,
3]. Handwriting, in fact, is a complex activity which involves several aspects including fine motor control, eye-hand coordination, visuo-spatial abilities, and so on [
4]. Evidence exists about the effectiveness of machine learning approaches aimed at discriminating between unhealthy and healthy subjects based on simple and easy-to-perform handwriting tasks, e.g., [
5,
6,
7].
The most employed approach to studying the diagnostic potential of handwriting tasks consists in exploiting
dynamic features of the handwriting process. This approach relies on the analysis of time series data characterizing handwriting, which can be acquired through the use of digitizing tablets provided with electronic pens (see, for example, [
8]). Such a device enables the collection of the geometric position of the pen at certain time stamps, as well as the pressure exerted over the writing surface, pen inclination, and if the movement of the pen is performed “in the air”.
Some recent studies adopted dynamic handwriting analysis for the purpose of PD classification showing encouraging results, e.g., [
9,
10]. However, all of them focused on the binary discrimination healthy/unhealthy independently of the degree of disease severity shown by the PD cohort. In other words, the Parkinsonian sample is considered as a single cluster in which all subjects share the same degree of severity. The aim of this paper is to investigate if and to which extent dynamic features of the handwriting process can help discriminate people suffering from PD at earlier stages.
To this end, a freely available dataset, i.e., PaHaW [
11], has been used, as it includes several patients at different degrees of disease severity. Dynamic features have then been extracted from the handwriting tasks performed by these subjects. Finally, a classification framework has been developed, based on both the analysis of the overall feature vector resulting from all tasks and the analysis of each task taken individually.
The rest of this paper is organized as follows.
Section 2 describes the data used for the present study.
Section 3 focuses on the experimental set up.
Section 4 analyzes the results obtained.
Section 5 concludes the work.
Author Contributions
Conceptualization, D.I., G.P. and G.V.; Formal Analysis, G.V.; Funding Acquisition, D.I. and G.P.; Investigation, D.I., G.P. and G.V.; Methodology, D.I., G.P. and G.V.; Project Administration, D.I. and G.P.; Software, G.V.; Supervision, G.P.; Validation, D.I., G.P. and G.V.; Writing—Original Draft, G.V.
Funding
This research was funded by the Italian Ministry of Education, University and Research within the PRIN2015-HAND Project under Grant H96J16000820001.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Lang, A.E.; Lozano, A.M. Parkinson’s disease. N. Engl. J. Med. 1998, 339, 1130–1143. [Google Scholar] [CrossRef] [PubMed]
- De Stefano, C.; Fontanella, F.; Impedovo, D.; Pirlo, G.; di Freca, A.S. Handwriting analysis to support neurodegenerative diseases diagnosis: A review. Pattern Recognit. Lett. 2018, in press. [Google Scholar] [CrossRef]
- Impedovo, D.; Pirlo, G. Dynamic handwriting analysis for the assessment of neurodegenerative diseases: A pattern recognition perspective. IEEE Rev. Biomed. Eng. 2018, in press. [Google Scholar] [CrossRef] [PubMed]
- Tseng, M.H.; Cermak, S.A. The influence of ergonomic factors and perceptual–motor abilities on handwriting performance. Am. J. Occup. Ther. 1993, 47, 919–926. [Google Scholar] [CrossRef] [PubMed]
- Rosenblum, S.; Samuel, M.; Zlotnik, S.; Erikh, I.; Schlesinger, I. Handwriting as an objective tool for Parkinson’s disease diagnosis. J. Neurol. 2013, 260, 2357–2361. [Google Scholar] [CrossRef] [PubMed]
- Kotsavasiloglou, C.; Kostikis, N.; Hristu-Varsakelis, D.; Arnaoutoglou, M. Machine learning-based classification of simple drawing movements in Parkinson’s disease. Biomed. Signal Process. Control 2017, 31, 174–180. [Google Scholar] [CrossRef]
- Pereira, C.R.; Pereira, D.R.; Rosa, G.H.; Albuquerque, V.H.; Weber, S.A.; Hook, C.; Papa, J.P. Handwritten dynamics assessment through convolutional neural networks: An application to Parkinson’s disease identification. Artif. Intell. Med. 2018, 87, 67–77. [Google Scholar] [CrossRef] [PubMed]
- Sybenga, S.; Rybarczyk, Y. Using machine learning and image processing for character recognition: An application for teaching handwriting. In Proceedings of the 28th International Conference on Computer Applications in Industry and Engineering, San Diego, CA, USA, 12–14 October 2015. [Google Scholar]
- Drotár, P.; Mekyska, J.; Rektorová, I.; Masarová, L.; Smékal, Z.; Faundez-Zanuy, M. Decision support framework for Parkinson’s disease based on novel handwriting markers. IEEE Trans. Neural Syst. Rehabil. Eng. 2015, 23, 508–516. [Google Scholar] [CrossRef] [PubMed]
- Zham, P.; Arjunan, S.; Raghav, S.; Kumar, D.K. Efficacy of guided spiral drawing in the classification of Parkinson’s Disease. IEEE J. Biomed. Health Inform. 2017, 5, 1648–1652. [Google Scholar] [CrossRef] [PubMed]
- Drotár, P.; Mekyska, J.; Rektorová, I.; Masarová, L.; Smékal, Z.; Faundez-Zanuy, M. Evaluation of handwriting kinematics and pressure for differential diagnosis of Parkinson’s disease. Artif. Intell. Med. 2016, 67, 39–46. [Google Scholar] [CrossRef] [PubMed]
- Goetz, C.G.; Tilley, B.C.; Shaftman, S.R.; Stebbins, G.T.; Fahn, S.; Martinez-Martin, P.; Poewe, W.; Sampaio, C.; Stern, M.B.; Dodel, R.; et al. Movement disorder society-sponsored revision of the unified Parkinson’s disease rating scale (MDS-UPDRS): Scale presentation and clinimetric testing results. Mov. Disord. Off. J. Mov. Disord. Soc. 2008, 23, 2129–2170. [Google Scholar] [CrossRef] [PubMed]
- Goetz, C.G.; Poewe, W.; Rascol, O.; Sampaio, C.; Stebbins, G.T.; Counsell, C.; Giladi, N.; Holloway, R.G.; Moore, C.G.; Wenning, G.K.; et al. Movement disorder society task force report on the hoehn and yahr staging scale: Status and recommendations the movement disorder society task force on rating scales for Parkinson’s disease. Mov. Disord. 2004, 19, 1020–1028. [Google Scholar] [CrossRef] [PubMed]
- Teulings, H.L.; Stelmach, G.E. Control of stroke size, peak acceleration, and stroke duration in Parkinsonian handwriting. Hum. Mov. Sci. 1991, 10, 315–334. [Google Scholar] [CrossRef]
- Van Gemmert, A.; Teulings, H.L.; Contreras-Vidal, J.L.; Stelmach, G. Parkinson’s disease and the control of size and speed in handwriting. Neuropsychologia 1999, 37, 685–694. [Google Scholar] [CrossRef]
- Broderick, M.P.; Van Gemmert, A.W.; Shill, H.A.; Stelmach, G.E. Hypometria and bradykinesia during drawing movements in individuals with Parkinson’s disease. Exp. Brain Res. 2009, 197, 223–233. [Google Scholar] [CrossRef] [PubMed]
- Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
- Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning; Springer: Berlin, Germany, 2009. [Google Scholar]
- Vapnik, V. The Nature of Statistical Learning Theory; Springer Science & Business Media: Berlin, Germany, 2013. [Google Scholar]
- Breiman, L. Random forests. In Machine Learning; Springer: Berlin, Germany, 2001; Volume 45, pp. 5–32. [Google Scholar]
- Freund, Y.; Schapire, R.E. A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 1997, 55, 119–139. [Google Scholar] [CrossRef]
- Rokach, L. Ensemble-based classifiers. Artif. Intell. Rev. 2010, 33, 1–39. [Google Scholar] [CrossRef]
- Guyon, I.; Elisseeff, A. An introduction to variable and feature selection. J. Mach. Learn. Res. 2003, 3, 1157–1182. [Google Scholar]
- Bidet-Ildei, C.; Pollak, P.; Kandel, S.; Fraix, V.; Orliaguet, J.P. Handwriting in patients with Parkinson disease: Effect of L-dopa and stimulation of the sub-thalamic nucleus on motor anticipation. Hum. Mov. Sci. 2011, 30, 783–791. [Google Scholar] [CrossRef] [PubMed]
- Pullman, S.L. Spiral analysis: A new technique for measuring tremor with a digitizing tablet. Mov. Disord. 1998, 13, 85–89. [Google Scholar] [CrossRef] [PubMed]
- Moetesum, M.; Siddiqi, I.; Vincent, N.; Cloppet, F. Assessing visual attributes of handwriting for prediction of neurological disorders—A case study on Parkinson’s disease. Pattern Recognit. Lett. 2018, in press. [Google Scholar] [CrossRef]
- Lacquaniti, F.; Terzuolo, C.; Viviani, P. The law relating the kinematic and figural aspects of drawing movements. Acta Psychol. 1983, 54, 115–130. [Google Scholar] [CrossRef]
- Rybarczyk, Y.; Carvalho, D. Effect of the implementation of the two-third power law in teleoperation. In Advances in Human Factors and System Interactions; Springer: Berlin, Germany, 2017; pp. 283–292. [Google Scholar]
Table 1.
PaHaW dataset. A line divides the Parkinson’s disease (PD) group into two subgroups depending on the disease severity: early to mild vs. mild to severe. In the present work, only the healthy control (HC) group and the early to mild subgroup have been taken into account.
| Age | Disease Duration | LED | M/F | Stage |
---|
Mean | Std | Mean | Std | Mean | Std |
---|
HC | 62.5 | 11.5 | – | – | – | – | 19/17 | – |
PD | 62.2 | 9.7 | 3.8 | 1.1 | 986.2 | 611.2 | 3/2 | 1 |
– | – | – | – | – | – | – | 1.5 |
70.6 | 12.0 | 7.0 | 3.4 | 1385.1 | 634.9 | 6/12 | 2 |
68.0 | 7.9 | 12.0 | 4.8 | 1674.4 | 478.1 | 4/2 | 2.5 |
69.0 | 7.2 | 12.5 | 4.1 | 1349.9 | 610.8 | 3/1 | 3 |
78.5 | 8.5 | 9.5 | 2.5 | 1383.3 | 66.6 | 2/0 | 4 |
75 | – | 18 | – | 1370.0 | – | 0/1 | 5 |
Table 2.
List of features. Unless otherwise specified, they are intended both on-surface and in-air.
Feature | s/v | Description |
---|
Stroke number | s | Number of strokes |
Displacement | v | Tangential trajectory during handwriting |
Velocity | v | Rate of change of position whit respect to time |
Acceleration | v | Rate of change of velocity with respect to time |
Jerk | v | Rate of change of acceleration with respect to time |
Horizontal/vertical displacement | v | Displacement in the horizontal/vertical direction |
Horizontal/vertical velocity | v | Velocity in the horizontal/vertical direction |
Horizontal/vertical acceleration | v | Acceleration in the horizontal/vertical direction |
Horizontal/vertical jerk | v | Jerk in the horizontal/vertical direction |
NCV | s | Mean number of local extrema of velocity |
NCA | s | Mean number of local extrema of acceleration |
Relative NCV | s | NCV relative to writing duration |
Relative NCA | s | NCA relative to writing duration |
Stroke size | v | Path length of each stroke |
Stroke duration | v | Movement time per stroke |
Speed | s | Trajectory during handwriting divided by writing duration |
Stroke speed | v | Trajectory during stroke divided by stroke duration |
Stroke height | v | Height of each stroke |
Stroke width | v | Width of each stroke |
On-surface time | s | Overall time spent on-surface |
In-air time | s | Overall time spent in-air |
Total time | s | On-surface time plus in-air time |
Normalized on-surface time | s | On-surface time normalized by total time |
Normalized in-air time | s | In-air time normalized by total time |
In-air/on-surface ratio | s | Ratio of time spent in-air/on-surface |
Mean pressure | v | Average pressure over all on-surface strokes |
NCP | s | Mean number of local extrema of pressure |
Relative NCP | s | NCP relative to writing duration |
Horizontal/vertical Shannon entropy | v | Shannon entropy of the horizontal/vertical component of the pen position |
Horizontal/vertical Rényi entropy | v | Second and third order Rényi entropy of the horizontal/vertical component of the pen position |
Horizontal/vertical signal-to-noise ratio | v | Signal-to-noise ratio of the horizontal/vertical component of the pen position |
Horizontal/vertical intrinsic Shannon entropy | v | Shannon entropy of the first/second IMF obtained by the EMD of the horizontal/vertical component of the pen position |
Horizontal/vertical intrinsic Rényi entropy | v | Second and third order Rényi entropy of the first/second IMF obtained by the EMD of the horizontal/vertical component of the pen position |
Horizontal/vertical signal-to-noise ratio | v | Signal-to-noise ratio of the first/second IMF obtained by the EMD of the horizontal/vertical component of the pen position |
Table 3.
Classification performance with features merged from all tasks. The best results are in bold.
Classifier | Accuracy | AUC | Sensitivity | Specificity |
---|
KNN | 67.90% | 72.22% | 48.28% | 83.33% |
SVM-RBF | 71.33% | 73.89% | 55.17% | 83.33% |
SVM-linear | 68.24% | 68.33% | 58.62% | 75.00% |
NB | 57.29% | 68.75% | 17.24% | 88.89% |
LDA | 66.81% | 70.83% | 58.62% | 72.22% |
RF | 73.38% | 75.00% | 62.07% | 83.33% |
ADA | 61.81% | 69.71% | 48.28% | 72.22% |
Table 4.
Confusion matrix for the RF classifier.
| HC (Predicted) | PD (Predicted) |
---|
HC (true) | 27 | 7 |
PD (true) | 11 | 18 |
Table 5.
Accuracy performance task by task. The best results per task are in bold; the best three tasks overall are in italics.
Task | KNN | SVM-RBF | SVM-lin. | NB | LDA | RF | ADA |
---|
(1) Spiral | 48.85% | 53.69% | 50.67% | 54.67% | 49.23% | 51.95% | 53.00% |
(2) l l l | 57.52% | 57.28% | 57.19% | 56.61% | 56.09% | 56.38% | 61.80% |
(3) le le le | 59.09% | 61.90% | 72.28% | 56.71% | 66.57% | 62.67% | 61.47% |
(4) les les les | 40.76% | 47.42% | 50.38% | 55.28% | 48.38% | 51.95% | 47.80% |
(5) lektorka | 51.76% | 45.57% | 49.23% | 49.57% | 47.57% | 45.04% | 59.80% |
(6) porovnat | 52.19% | 61.80% | 62.00% | 44.23% | 63.71% | 56.14% | 60.33% |
(7) nepopadnout | 47.57% | 46.14% | 54.80% | 45.80% | 52.19% | 59.09% | 60.28% |
(8) Sentence task | 58.28% | 71.09% | 59.23% | 71.95% | 64.23% | 66.85% | 59.47% |
Table 6.
Classification performance of the ensemble of tasks. The best results are in bold.
Ensemble Scheme | Accuracy | AUC | Sensitivity | Specificity |
---|
All tasks | 69.52% | 83.06% | 31.03% | 100.00% |
Three best tasks (task 3, 6 and 8) | 74.76% | 82.78% | 68.97% | 77.78% |
Table 7.
Confusion matrix for the ensemble of all tasks.
| HC (Predicted) | PD (Predicted) |
---|
HC (true) | 36 | 0 |
PD (true) | 20 | 9 |
Table 8.
Confusion matrix for the ensemble of the best three tasks.
| HC (Predicted) | PD (Predicted) |
---|
HC (true) | 28 | 8 |
PD (true) | 9 | 10 |
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).