A Differential Evolution Approach to Optimize Weights of Dynamic Time Warping for Multi-Sensor Based Gesture Recognition
Abstract
:1. Introduction
2. Dynamic Time Warping
2.1. Dynamic Time Warping for Gesture Recognition
2.2. Weighted DTW for Multiple Sensors
3. Differential Evolution to Optimize the Weights of DTW
4. Experimental Results
4.1. Experimental Simulation for Bowling Action
4.2. Experimental Results of the Adjusted Weights by Differential Evolution (α = 0.1)
4.3. Experimental Results of the Adjusted Weights by Differential Evolution (α = 1)
5. Conclusions/Recommendations
Author Contributions
Acknowledgments
Conflicts of Interest
References
- Mitra, S.; Acharya, T. Gesture recognition: A survey. IEEE Trans. Syst. Man Cybern. Part C (Appl. Rev.) 2007, 37, 311–324. [Google Scholar] [CrossRef]
- Pavlovic, V.I.; Sharma, R.; Huang, T.S. Visual interpretation of hand gestures for human-computer interaction: A review. IEEE Trans. Pattern Anal. Mach. Intell. 1997, 19, 677–695. [Google Scholar] [CrossRef]
- Mortensen, E.N.; Deng, H.; Shapiro, L. A SIFT descriptor with global context. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, San Diego, CA, USA, 20–26 June 2005; Volume 1, pp. 184–190. [Google Scholar]
- Dalal, N.; Triggs, B. Histograms of oriented gradients for human detection. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, San Diego, CA, USA, 20–26 June 2005; Volume 1, pp. 886–893. [Google Scholar]
- Ganapathi, V.; Plagemann, C.; Koller, D.; Thrun, S. Real time motion capture using a single time-of-flight camera. In Proceedings of the 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), San Francisco, CA, USA, 13–18 June 2010; pp. 755–762. [Google Scholar]
- Jain, H.P.; Subramanian, A.; Das, S.; Mittal, A. Real-time upper-body human pose estimation using a depth camera. In Proceedings of the International Conference on Computer Vision/Computer Graphics Collaboration Techniques and Applications, Rocquencourt, France, 10–11 October 2011; pp. 227–238. [Google Scholar]
- Suarez, J.; Murphy, R.R. Hand gesture recognition with depth images: A review. In Proceedings of the 2012 IEEE RO-MAN, Paris, France, 9–13 September 2012; pp. 411–417. [Google Scholar]
- Kern, N.; Schiele, B.; Schmidt, A. Multi-sensor activity context detection for wearable computing. In Proceedings of the European Symposium on Ambient Intelligence, Veldhoven, The Netherlands, 3–4 November 2003; pp. 220–232. [Google Scholar]
- Kamijoh, N.; Inoue, T.; Olsen, C.M.; Raghunath, M.T.; Narayanaswami, C. Energy trade-offs in the IBM wristwatch computer. In Proceedings of the Fifth International Symposium on Wearable Computers, Zurich, Switzerland, 8–9 October 2001; pp. 133–140. [Google Scholar]
- Chen, C.; Jafari, R.; Kehtarnavaz, N. A survey of depth and inertial sensor fusion for human action recognition. Multimed. Tools Appl. 2017, 76, 4405–4425. [Google Scholar] [CrossRef]
- Sakoe, H.; Chiba, S. Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans. Acoust. Speech Signal Process. 1978, 26, 43–49. [Google Scholar] [CrossRef]
- Lee, H.K.; Kim, J.H. An HMM-based threshold model approach for gesture recognition. IEEE Trans. Pattern Anal. Mach. Intell. 1999, 21, 961–973. [Google Scholar] [Green Version]
- Sminchisescu, C.; Kanaujia, A.; Metaxas, D. Conditional models for contextual human motion recognition. Comput. Vis. Image Underst. 2006, 104, 210–220. [Google Scholar] [CrossRef] [Green Version]
- Wang, P.; Li, W.; Gao, Z.; Zhang, J.; Tang, C.; Ogunbona, P.O. Action recognition from depth maps using deep convolutional neural networks. IEEE Trans. Hum.-Mach. Syst. 2016, 46, 498–509. [Google Scholar] [CrossRef]
- Chun, Q.; Zhang, E. Human action recognition based on improved motion history image and deep convolutional neural networks. In Proceedings of the 2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI), Shanghai, China, 14–16 October 2017; pp. 1–5. [Google Scholar]
- Li, C.; Hou, Y.; Wang, P.; Li, W. Joint distance maps based action recognition with convolutional neural networks. IEEE Signal Process. Lett. 2017, 24, 624–628. [Google Scholar] [CrossRef]
- Hou, Y.; Li, Z.; Wang, P.; Li, W. Skeleton optical spectra based action recognition using convolutional neural networks. IEEE Trans. Circuits Syst. Video Technol. 2018, 28, 807–811. [Google Scholar] [CrossRef]
- Choi, H.R.; Kim, T. Combined dynamic time warping with multiple sensors for 3D gesture recognition. Sensors 2017, 17, 1893. [Google Scholar] [CrossRef] [PubMed]
- Fuad, M.M.M. Differential evolution-based weighted combination of distance metrics for k-means clustering. In Proceedings of the International Conference on Theory and Practice of Natural Computing, Granada, Spain, 9–11 December 2014; Springer: Cham, Switzerland, 2014; pp. 193–204. [Google Scholar]
- Islam, S.M.; Das, S.; Ghosh, S.; Roy, S.; Suganthan, P.N. An adaptive differential evolution algorithm with novel mutation and crossover strategies for global numerical optimization. IEEE Trans. Syst. Man Cybern. Part B Cybern. 2012, 42, 482–500. [Google Scholar] [CrossRef] [PubMed]
- Zhang, Z. Microsoft kinect sensor and its effect. IEEE Multimed. 2012, 19, 4–10. [Google Scholar] [CrossRef]
- Das, S.; Suganthan, P.N. Differential evolution: A survey of the state-of-the-art. IEEE Trans. Evol. Comput. 2011, 15, 4–31. [Google Scholar] [CrossRef]
- Mandal, S.K.; Mahanti, G.K.; Ghatak, R. Differential evolution algorithm for optimizing the conflicting parameters in time-modulated linear array antennas. Prog. Electromagn. Res. B 2013, 51, 101–118. [Google Scholar] [CrossRef]
- Bochinski, E.; Senst, T.; Sikora, T. Hyper-parameter optimization for convolutional neural network committees based on evolutionary algorithms. In Proceedings of the 2017 IEEE International Conference on Image Processing (ICIP), Beijing, China, 17–20 September 2017; IEEE: Beijing, China, 2017; pp. 3924–3928. [Google Scholar]
- Rocca, P.; Oliveri, G.; Massa, A. Differential evolution as applied to electromagnetics. IEEE Antennas Propag. Mag. 2011, 53, 38–49. [Google Scholar] [CrossRef]
- Oliveri, G.; Rocca, P.; Massa, A. Differential evolution as applied to electromagnetics: Advances, comparisons, and applications. In Proceedings of the 2012 6th European Conference on Antennas and Propagation (EUCAP), Prague, Czech Republic, 26–30 March 2012; IEEE: Prague, Czech Republic, 2012; pp. 3058–3059. [Google Scholar]
- Chen, C.; Jafari, R.; Kehtarnavaz, N. Utd-mhad: A multimodal dataset for human action recognition utilizing a depth camera and a wearable inertial sensor. In Proceedings of the 2015 IEEE International Conference on Image Processing (ICIP), Quebec City, QC, Canada, 27–30 September 2015; pp. 168–172. [Google Scholar]
S/N | Action |
---|---|
1 | Right arm swipe to the left |
2 | Right arm swipe to the right |
3 | Right hand wave |
4 | Two hand front clap |
5 | Right arm throw |
6 | Cross arms in the chest |
7 | Basketball shoot |
8 | Right hand draw x |
9 | Right hand draw circle (clockwise) |
10 | Right hand draw circle (counter clockwise) |
11 | Draw triangle |
12 | Bowling (right hand) |
13 | Front boxing |
14 | Baseball swing from right |
15 | Tennis right hand forehand swing |
16 | Arm curl (two arms) |
17 | Tennis serve |
18 | Two hand push |
19 | Right hand knock on the door |
20 | Right hand catch an object |
21 | Right hand pick up and throw |
22 | Jogging in place |
23 | Walking in place |
24 | Sit to stand |
25 | Stand to sit |
26 | Forward lunge (left foot forward) |
27 | Squat (two arms stretched out) |
Technic Used | Accuracy of Recognition | Characteristics |
---|---|---|
Joint Trajectory Map | 89.81% | Accumulating the user’s joints trajectory, Based on CNN [17]. |
Joint Distance Maps | 88.1% | The distance between the joints of the user’s expression, based on CNN [16]. |
Motion History Map | 84.00% | Cumulative recording of the user’s movement by the use of an image, CNN-based [15]. |
Depth Motion Map | 79.10% | Human action recognition using a depth camera and a wearable inertial Sensor. Data fusion from multi-sensors [27]. |
Proposed Approach | 99.40% | Differential evolution to optimize weights. DTW Based. |
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Rwigema, J.; Choi, H.-R.; Kim, T. A Differential Evolution Approach to Optimize Weights of Dynamic Time Warping for Multi-Sensor Based Gesture Recognition. Sensors 2019, 19, 1007. https://doi.org/10.3390/s19051007
Rwigema J, Choi H-R, Kim T. A Differential Evolution Approach to Optimize Weights of Dynamic Time Warping for Multi-Sensor Based Gesture Recognition. Sensors. 2019; 19(5):1007. https://doi.org/10.3390/s19051007
Chicago/Turabian StyleRwigema, James, Hyo-Rim Choi, and TaeYong Kim. 2019. "A Differential Evolution Approach to Optimize Weights of Dynamic Time Warping for Multi-Sensor Based Gesture Recognition" Sensors 19, no. 5: 1007. https://doi.org/10.3390/s19051007
APA StyleRwigema, J., Choi, H.-R., & Kim, T. (2019). A Differential Evolution Approach to Optimize Weights of Dynamic Time Warping for Multi-Sensor Based Gesture Recognition. Sensors, 19(5), 1007. https://doi.org/10.3390/s19051007