Improving the Performance and Explainability of Indoor Human Activity Recognition in the Internet of Things Environment
Abstract
1. Introduction
2. Related Work
3. Proposed Method
3.1. Problem Definition
3.2. Proposed Approach
3.3. Formal Definition
| Algorithm 1. Human Activity Recognition on Signal Images (HARSI) | 
| Inputs:D = {(x1, y1, z1, a1), (x2, y2, z2, a2),…., (xn, yn, zn, an)} q: window size T: Test set Output: O = {o1, o2, …, ot} a set of outputs for test images Begin: for i = 0 to n/q do W = Ø for j = i *q to i * q + q − 1 do W = W U (xj, yj, zj,) activity = aj end for image = ConvertToImage(W) I = I U <image, activity> end for M = CNN(I) foreach image i in T do o = Classify(M, i) O = O U o end foreach Return O End | 
4. Experimental Studies
4.1. Dataset Description
4.2. Comparison of Different CNN Architectures
4.3. Comparison with the Classical Machine Learning Methods
4.4. Comparison with the State-of-the-Art Methods
4.5. Discussion
- In HAR, the ideal input data format is still a subject of much debate and there are various ongoing works for improving the accuracy of the models. Traditional HAR has been defined as a time-series data classification problem and requires feature extraction. In contrast, we transfer time-series data into signal images that reflect the properties of human activities. It avoids the need to perform an explicit feature generation and selection stage. We improved accuracy by working on signal image data, instead of numerical time-series data.
- Many applications in HAR [33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50] have used classical machine learning methods such as DT, SVM, MLP, NB, LR, KNN, and RF. However, the performance of these methods is still highly debated. In this study, we take advantage of the strengths of deep learning approaches.
- Another debate is how to design CNN architecture to be able to obtain good performance. For example, the number of layers and parameter settings are still subjects of much debate. In this study, we compared nine different CNN architectures to determine the best suitable one.
- In the activity recognition community, there is an open debate on providing explainability in the HAR systems. The main problem is how to increase the transparency and interpretability of the models. In this study, to increase human-level explainability, we visualize the data with charts since generating signal images makes data understandable for humans.
- Another ongoing debate is which activities can be predicted more precisely. This study showed that the best accuracies were achieved in the sitting, standing, and jogging activities due to their diverse natures.
- The proposed HAR model can be connected to many different fields of study such as health monitoring, fitness tracking, home and work automation, and self-managing system. With the rapid technological developments in smartphones, the model can enable new opportunities for developing informative systems on a large scale to perceive and act on what users (i.e., your children, elderly mother, or sick family member) are doing. Recognizing human activities is important for the treatment of patients and can provide useful feedback to the clinicians since the activity is associated with health. For example, it can be used to monitor patients in rehabilitation since the functional status of a person is an important parameter in this area. In addition, it could be used to offer activity-aware services to smartphone users, such as movement recommendations. A number of lifestyle diseases and movement disorders are associated with inactivity; therefore, our model can be used to give information to prevent diseases. The users can participate in the tracking of their activities for the sake of health, fitness, or other purposes due to its strength in providing personalized support.
5. Conclusions and Future Works
- The proposed approach improves human-level explainability for smart sensor data by using signal images in the field of HAR.
- The proposed HARSI approach improves the recognition accuracy in the HAR problems by converting time-series data to image data.
- The experimental results showed that HARSI successfully (98%) recognized six symmetric human activities, including walking, jogging, standing, sitting, moving downstairs, and moving upstairs.
- According to the experimental results, it can be concluded that the best suitable and consistent CNN model for the WISDM dataset is VGG19. It achieved the best results on all the metrics (accuracy, precision, recall, and f-measure). Therefore, this model can be successfully used to identify human activities.
- The prediction accuracy changes according to human activities. Among the activities, sitting, standing, and jogging were correctly predicted by the proposed method. On the other hand, the model had a little difficulty in classifying downstairs and upstairs activities with an accuracy of 95% for the WISDM dataset.
- The number of layers and number of parameters of a CNN model may be associated with computational complexity, where the higher the number of layers and parameters, the greater the computational load during the training process.
- A significant improvement (13.72% on average) was achieved by the proposed HARSI model compared to the classical machine learning methods such as KNN, DT, SVM, NB, LR, MLP, AdaBoost, and RF.
- Our approach achieved higher classification accuracy than the state-of-the-art approaches. It outperformed them by 7.06% on average on the same dataset.
- The proposed HARSI approach has the potential to expand the application of machine learning in many different sectors, thanks to its advantages.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Hassan, M.M.; Ullah, S.; Hossain, M.S.; Alelaiwi, A. An end-to-end deep learning model for human activity recognition from highly sparse body sensor data in internet of medical things environment. J. Supercomput. 2021, 77, 2237–2250. [Google Scholar] [CrossRef]
- Kanjilal, R.; Uysal, I. The future of human activity recognition: Deep learning or feature engineering? Neural Process. Lett. 2021, 53, 561–579. [Google Scholar] [CrossRef]
- Mekruksavanich, S.; Jitpattanakul, A. Biometric user identification based on human activity recognition using wearable sensors: An experiment using deep learning models. Electronics 2021, 10, 308. [Google Scholar] [CrossRef]
- Mihoub, A. A deep learning-based framework for human activity recognition in smart homes. Mob. Inf. Syst. 2021, 11, 6961343. [Google Scholar] [CrossRef]
- Mohsen, S.; Elkaseer, A.; Scholz, S.G. Industry 4.0-oriented deep learning models for human activity recognition. IEEE Access 2021, 9, 150508–150521. [Google Scholar] [CrossRef]
- Madokoro, H.; Nix, S.; Woo, H.; Sato, K. A mini-survey and feasibility study of deep-learning-based human activity recognition from slight feature signals obtained using privacy-aware environmental sensors. Appl. Sci. 2021, 11, 11807. [Google Scholar] [CrossRef]
- Casilari, E.; Álvarez-Marco, M.; García-Lagos, F. A study of the use of gyroscope measurements in wearable fall detection systems. Symmetry 2020, 12, 649. [Google Scholar] [CrossRef]
- Shalaby, E.; ElShennawy, N.; Sarhan, A. Utilizing deep learning models in CSI-based human activity recognition. Neural Comput. Appl. 2022, 34, 5993–6010. [Google Scholar] [CrossRef]
- Bijalwan, V.; Semwal, V.B.; Gupta, V. Wearable sensor-based pattern mining for human activity recognition: Deep learning approach. Ind. Robot-Int. J. Robot Res. Appl. 2021, 49, 21–33. [Google Scholar] [CrossRef]
- Ferrari, A.; Micucci, D.; Mobilio, M.; Napoletano, P. Deep learning and model personalization in sensor-based human activity recognition. J. Reliab. Intell. Environ. 2022, 2022, 1–13. [Google Scholar] [CrossRef]
- Manjarrés, J.; Lan, G.; Gorlatova, M.; Hassan, M.; Pardo, M. Deep learning for detecting human activities from piezoelectric-based kinetic energy signals. IEEE Internet Things J. 2022, 9, 7545–7558. [Google Scholar] [CrossRef]
- Bhat, O.; Khan, D.A. Evaluation of deep learning model for human activity recognition. Evol. Syst. 2022, 13, 159–168. [Google Scholar] [CrossRef]
- Bozkurt, F. A comparative study on classifying human activities using classical machine and deep learning methods. Arab. J. Sci. Eng. 2022, 47, 1507–1521. [Google Scholar] [CrossRef]
- Khan, A.R.; Saba, T.; Khan, M.Z.; Fati, S.M.; Khan, M.U.G. Classification of human’s activities from gesture recognition in live videos using deep learning. Concurr. Computat. Pract. Exper. 2022, 34, e6825. [Google Scholar] [CrossRef]
- Khan, I.U.; Afzal, S.; Lee, J.W. Human activity recognition via hybrid deep learning based model. Sensors 2022, 22, 323. [Google Scholar] [CrossRef]
- Tasnim, N.; Islam, M.K.; Baek, J.-H. Deep learning based human activity recognition using spatio-temporal image formation of skeleton joints. Appl. Sci. 2021, 11, 2675. [Google Scholar] [CrossRef]
- Maitre, J.; Bouchard, K.; Gaboury, S. Alternative deep learning architectures for feature-level fusion in human activity recognition. Mob. Netw. Appl. 2021, 26, 2076–2086. [Google Scholar] [CrossRef]
- Hwang, Y.M.; Park, S.; Lee, H.O.; Ko, S.-K.; Lee, B.-T. Deep Learning for human activity recognition based on causality feature extraction. IEEE Access 2021, 9, 112257–112275. [Google Scholar] [CrossRef]
- Ronald, M.; Poulose, A.; Han, D.S. iSPLInception: An inception-ResNet deep learning architecture for human activity recognition. IEEE Access 2021, 9, 68985–69001. [Google Scholar] [CrossRef]
- Pei, L.; Xia, S.; Chu, L.; Xiao, F.; Wu, Q.; Yu, W.; Qiu, R. MARS: Mixed virtual and real wearable sensors for human activity recognition with multidomain deep learning model. IEEE Internet Things J. 2021, 8, 9383–9396. [Google Scholar] [CrossRef]
- Yen, C.-T.; Liao, J.-X.; Huang, Y.-K. Feature fusion of a deep-learning algorithm into wearable sensor devices for human activity recognition. Sensors 2021, 21, 8294. [Google Scholar] [CrossRef]
- Irfan, S.; Anjum, N.; Masood, N.; Khattak, A.S.; Ramzan, N. A novel hybrid deep learning model for human activity recognition based on transitional activities. Sensors 2021, 21, 8227. [Google Scholar] [CrossRef]
- Al-Wesabi, F.N.; Albraikan, A.A.; Hilal, A.M.; Al-Shargabi, A.A.; Alhazbi, S.; Duhayyim, M.A.; Rizwanullah, M.; Hamza, M.A. Design of optimal deep learning based human activity recognition on sensor enabled internet of things environment. IEEE Access 2021, 9, 143988–143996. [Google Scholar] [CrossRef]
- Alawneh, L.; Alsarhan, T.; Al-Zinati, M.; Al-Ayyoub, M.; Jararweh, Y.; Lu, H. Enhancing human activity recognition using deep learning and time series augmented data. J. Ambient Intell. Humaniz. Comput. 2021, 12, 10565–10580. [Google Scholar] [CrossRef]
- Elsts, A.; McConvill, R. Are microcontrollers ready for deep learning-based human activity recognition? Electronics 2021, 10, 2640. [Google Scholar] [CrossRef]
- Ghate, V.; Hemalatha, S.C. Hybrid deep learning approaches for smartphone sensor-based human activity recognition. Multimed. Tools Appl. 2021, 80, 35585–35604. [Google Scholar] [CrossRef]
- Thakur, D.; Biswas, S. Feature fusion using deep learning for smartphone based human activity recognition. Int. J. Inf. Tecnol. 2021, 13, 1615–1624. [Google Scholar] [CrossRef]
- Buffelli, D.; Vandin, F. Attention-based deep learning framework for human activity recognition with user adaptation. IEEE Sens. J. 2021, 21, 13474–13483. [Google Scholar] [CrossRef]
- Alhersh, T.; Stuckenschmidt, H.; Rehman, A.U.; Belhaouari, S.B. Learning human activity from visual data using deep learning. IEEE Access 2021, 9, 106245–106253. [Google Scholar] [CrossRef]
- Chen, L.; Liu, X.; Peng, L.; Wu, M. Deep learning based multimodal complex human activity recognition using wearable devices. Appl. Intell. 2021, 51, 4029–4042. [Google Scholar] [CrossRef]
- Thu, N.H.T.; Han, D.S. HiHAR: A hierarchical hybrid deep learning architecture for wearable sensor-based human activity recognition. IEEE Access 2021, 9, 145271–145281. [Google Scholar] [CrossRef]
- Stuart, M.; Manic, M. Deep learning shared bandpass filters for resource-constrained human activity recognition. IEEE Access 2021, 9, 39089–39097. [Google Scholar] [CrossRef]
- Kwapisz, J.R.; Weiss, G.M.; Moore, S.A. Activity recognition using cell phone accelerometers. ACM SIGKDD Explor. Newsl. 2010, 12, 74–82. [Google Scholar] [CrossRef]
- Suwannarat, K.; Kurdthongmee, W. Optimization of deep neural network-based human activity recognition for a wearable device. Heliyon 2021, 7, e07797. [Google Scholar] [CrossRef]
- Vijayvargiya, A.; Kumari, N.; Gupta, P.; Kumar, R. Implementation of machine learning algorithms for human activity recognition. In Proceedings of the 3rd International Conference on Signal Processing and Communication (ICPSC), Coimbatore, India, 13–14 May 2021; pp. 440–444. [Google Scholar]
- Semwal, V.B.; Lalwani, P.; Mishra, M.K.; Bijalwan, V.; Chadha, J.S. An optimized feature selection using bio-geography optimization technique for human walking activities recognition. Computing 2021, 103, 2893–2914. [Google Scholar] [CrossRef]
- Kee, Y.J.; Zainudin, M.S.; Idris, M.I.; Ramlee, R.H.; Kamarudin, M.R. Activity recognition on subject independent using machine learning. Cybern. Inf. Technol. 2020, 20, 64–74. [Google Scholar] [CrossRef]
- Jalal, A.; Quaid, M.A.K.; Kim, K. A study of accelerometer and gyroscope measurements in physical life-log activities detection systems. Sensors 2020, 20, 6670. [Google Scholar] [CrossRef]
- Khare, S.; Sarkar, S.; Totaro, M. Comparison of sensor-based datasets for human activity recognition in wearable IoT. In Proceedings of the IEEE 6th World Forum on Internet of Things (WF-IoT), New Orleans, LA, USA, 2–16 June 2020; pp. 1–6. [Google Scholar]
- Arigbabu, O.A. Entropy decision fusion for smartphone sensor based human activity recognition. arXiv 2021, arXiv:2006.00367v1. [Google Scholar]
- Xu, H.; Li, J.; Yuan, H.; Liu, Q.; Fan, S.; Li, T.; Sun, X. Human activity recognition based on gramian angular field and deep convolutional neural network. IEEE Access 2022, 8, 199393–199405. [Google Scholar] [CrossRef]
- Lu, W.; Fan, F.; Chu, J.; Jing, P.; Yuting, S. Wearable computing for internet of things: A discriminant approach for human activity recognition. IEEE Internet Things J. 2019, 6, 2749–2759. [Google Scholar] [CrossRef]
- Ignatov, A. Real-time human activity recognition from accelerometer data using convolutional neural networks. Appl. Soft. Comput. 2018, 62, 915–922. [Google Scholar] [CrossRef]
- Xu, W.; Pang, Y.; Yang, Y.; Liu, Y. Human activity recognition based on convolutional neural network. In Proceedings of the 24th International Conference on Pattern Recognition, Beijing, China, 20–24 August 2018; pp. 165–170. [Google Scholar]
- Quispe, K.G.M.; Lima, W.S.; Batista, D.M.; Souto, E. MBOSS: A symbolic representation of human activity recognition using mobile sensors. Sensors 2018, 18, 4354. [Google Scholar] [CrossRef] [PubMed]
- Azmi, M.S.M.; Sulaiman, M.N. Accelerator-Based human activity recognition using voting technique with NBTREE and MLP classifiers. Int. J. Adv. Sci. Eng. Inf. Technol. 2017, 7, 146–152. [Google Scholar] [CrossRef]
- Walse, K.H.; Dharaskar, R.V.; Thakare, V.M. A study of human activity recognition using adaboost classifiers on WISDM dataset. Inst. Integr. Omics Appl. Biotechnol. J. 2016, 7, 68–76. [Google Scholar]
- Catal, C.; Tufekci, S.; Pirmit, E.; Kocabag, G. On the use of ensemble of classifiers for accelerometer-based activity recognition. Appl. Soft. Comput. 2015, 37, 1018–1022. [Google Scholar] [CrossRef]
- Zainudin, M.S.; Sulaiman, M.N.; Mustapha, N.; Perumal, T. Activity recognition based on accelerometer sensor using combinational classifiers. In Proceedings of the IEEE Conference on Open Systems (ICOS), Melaka, Malaysia, 24–26 August 2015; pp. 68–73. [Google Scholar]
- Kolosnjaji, B.; Eckert, C. Neural network-based user-independent physical activity recognition for mobile devices. In Lecture Notes in Computer Science; Jackowski, K., Burduk, R., Walkowiak, K., Wozniak, M., Yin, H., Eds.; Springer: Cham, Switzerland, 2015; pp. 378–386. [Google Scholar]
- Zhang, J.; Qiao, S.; Lin, Z.; Zhou, Y. Human activity recognition based on residual network. In Proceedings of the 8th Annual International Conference on Geo-Spatial Knowledge and Intelligence, Xian, China, 18–19 December 2020; pp. 1–6. [Google Scholar]
- Lin, S.B.; Wang, K.; Wang, Y.; Zhou, D.X. Universal consistency of deep convolutional neural networks. arXiv 2021, arXiv:2106.12498. [Google Scholar] [CrossRef]
- Zihao, Z.; Geng, J.; Jiang, W. A time series classification method based on 1DCNN-FNN. In Proceedings of the 33rd Chinese Control and Decision Conference (CCDC), Kunming, China, 22–24 May 2021; pp. 1566–1571. [Google Scholar]
- Garcia, K.D.; de Sá, C.R.; Poel, M.; Carvalho, T.; Mendes-Moreira, J.; Cardoso, J.M.; de Carvalho, A.C.P.L.F.; Kok, J.N. An ensemble of autonomous auto-encoders for human activity recognition. Neurocomputing 2021, 439, 271–280. [Google Scholar] [CrossRef]
- Lima, W.S.; Bragança, H.L.; Souto, E.J. NOHAR-NOvelty discrete data stream for human activity recognition based on smartphones with inertial sensors. Expert Syst. Appl. 2021, 166, 114093. [Google Scholar] [CrossRef]
- Sena, J.; Barreto, J.; Caetano, C.; Cramer, G.; Schwartz, W.R. Human activity recognition based on smartphone and wearable sensors using multiscale DCNN ensemble. Neurocomputing 2021, 444, 226–243. [Google Scholar] [CrossRef]
- Ramesh, A.K.; Gajjala, K.S.; Nakano, K.; Chakraborty, B. Person authentication by gait data from smartphone sensors using convolutional autoencoder. In Proceedings of the International Conference on Intelligence Science, Durgapur, India, 24–27 February 2021; pp. 149–158. [Google Scholar]
- Dhammi, L.; Tewari, P. Classification of human activities using data captured through a smartphone using deep learning techniques. In Proceedings of the 3rd International Conference on Signal Processing and Communication (ICPSC), Coimbatore, India, 13–14 May 2021; pp. 1–6. [Google Scholar]
- Wenzheng, Z. Human activity recognition based on acceleration sensor and neural network. In Proceedings of the 8th International Conference on Orange Technology (ICOT), Daegu, Korea, 18–21 December 2020; pp. 1–5. [Google Scholar]
- Peppas, K.; Tsolakis, A.C.; Krinidis, S.; Tzovaras, D. Real-time physical activity recognition on smart mobile devices using convolutional neural networks. Appl. Sci. 2020, 10, 8482. [Google Scholar] [CrossRef]
- Mehmood, K.; Imran, H.A.; Latif, U. HARDenseNet: A 1D DenseNet inspired convolutional neural network for human activity recognition with inertial sensors. In Proceedings of the 23rd International Multitopic Conference (INMIC), Bahawalpur, Pakistan, 5–7 November 2020; pp. 1–6. [Google Scholar]
- Aswal, V.; Sreeram, V.; Kuchik, A.; Ahuja, S.; Patel, H. Real-time human activity generation using bidirectional long short term memory networks. In Proceedings of the 4th International Conference on Intelligent Computing and Control Systems (ICICCS), Madurai, India, 13–15 May 2020; pp. 775–780. [Google Scholar]
- Xia, K.; Huang, J.; Wang, H. LSTM-CNN architecture for human activity recognition. IEEE Access 2020, 8, 56855–56866. [Google Scholar] [CrossRef]
- Agarwal, P.; Alam, M. A lightweight deep learning model for human activity recognition on edge devices. Procedia Comput. Sci. 2020, 167, 2364–2373. [Google Scholar] [CrossRef]
- Zhang, H.; Xiao, Z.; Wang, J.; Li, F.; Szczerbicki, E. A novel IoT-perceptive human activity recognition (HAR) approach using multihead convolutional attention. IEEE Internet Things J. 2019, 7, 1072–1080. [Google Scholar] [CrossRef]
- Huang, J.; Lin, S.; Wang, N.; Dai, G.; Xie, Y.; Zhou, J. TSE-CNN: A two-stage end-to-end CNN for human activity recognition. IEEE J. Biomed. Health Inf. 2019, 24, 292–299. [Google Scholar] [CrossRef] [PubMed]
- Mukherjee, D.; Mondal, R.; Singh, P.K.; Sarkar, R.; Bhattacharjee, D. EnsemConvNet: A deep learning approach for human activity recognition using smartphone sensors for healthcare applications. Multimed. Tools Appl. 2020, 79, 31663–31690. [Google Scholar] [CrossRef]
- Tang, Y.; Teng, Q.; Zhang, L.; Min, F.; He, J. Efficient convolutional neural networks with smaller filters for human activity recognition using wearable sensors. arXiv 2020, arXiv:2005.03948v1. [Google Scholar]
- Beirami, M.J.; Shojaedini, S.V. Residual network of residual network: A new deep learning modality to improve human activity recognition by using smart sensors exposed to unwanted shocks. J. Health Manag. Inf. 2020, 7, 228–239. [Google Scholar]
- Zhang, Y.; Zhang, Z.; Zhang, Y.; Bao, J.; Zhang, Y.; Deng, H. Human activity recognition based on motion sensor using U-Net. IEEE Access 2019, 7, 75213–75226. [Google Scholar] [CrossRef]
- Pienaar, S.W.; Malekian, R. Human activity recognition using LSTM-RNN deep neural network architecture. In Proceedings of the IEEE 2nd Wireless Africa Conference, Pretoria, South Africa, 18–20 August 2019; pp. 1–5. [Google Scholar]
- Manu, R.D.; Kumar, S.; Snehashish, S.; Rekha, K.S. Smart home automation using IoT and deep learning. Int. Res. J. Eng. Technol. 2019, 6, 1–4. [Google Scholar]
- Varamin, A.A.; Abbasnejad, E.; Shi, Q.; Ranasinghe, D.C.; Rezatofighi, H. Deep auto-set: A deep auto-encoder-set network for activity recognition using wearables. In Proceedings of the 15th EAI International Conference on Mobile and Ubiquitous Systems: Computing, Networking and Services, New York, NY, USA, 5–7 November 2018; pp. 246–253. [Google Scholar]
- Chandini, U. A Machine learning based activity recognition for ambient assisted living. Int. J. Future Revolut. Comput. Sci. Commun. Eng. 2018, 4, 323–326. [Google Scholar]
- Dungkaew, T.; Suksawatchon, J.; Suksawatchon, U. Impersonal smartphone-based activity recognition using the accelerometer sensory data. In Proceedings of the 2nd International Conference on Information Technology (INCIT), Nakhonpathom, Thailand, 2–3 November 2017; pp. 1–6. [Google Scholar]
- Chen, Y.; Zhong, K.; Zhang, J.; Sun, Q.; Zhao, X. LSTM networks for mobile human activity recognition. In Proceedings of the International Conference on Artificial Intelligence: Technologies and Applications, Bangkok, Thailand, 24–25 January 2016; pp. 50–53. [Google Scholar]
- Abdallah, Z.S.; Gaber, M.M.; Srinivasan, B.; Krishnaswamy, S. Adaptive mobile activity recognition system with evolving data streams. Neurocomputing 2015, 150, 304–317. [Google Scholar] [CrossRef]
- Mekruksavanich, S.; Jitpattanakul, A.; Youplao, P.; Yupapin, P. Enhanced hand-oriented activity recognition based on smartwatch sensor data using LSTMs. Symmetry 2020, 12, 1570. [Google Scholar] [CrossRef]
- Han, D.; Lee, C.; Kang, H. Gravity control-based data augmentation technique for improving VR user activity recognition. Symmetry 2021, 13, 845. [Google Scholar] [CrossRef]
- Su, J.-Y.; Cheng, S.-C.; Huang, D.-K. Unsupervised object modeling and segmentation with symmetry detection for human activity recognition. Symmetry 2015, 7, 427–449. [Google Scholar] [CrossRef]








| Ref | Year | Method | Description | Sensor Types | Data | Sensor Location | Number of Activities | Sensor-Data-Based | Signal-Image-Based | XAI | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CNN | DNN | RNN | LSTM | AE | ||||||||||
| [8] | 2022 | √ | √ | √ | Channel state information (CSI) based HAR | Wireless signal | CSI | Room | 6 | √ | X | X | ||
| [9] | 2022 | √ | √ | √ | Gait pattern analysis | A, G, M | SG * | Center of mass | 7 | √ | X | X | ||
| [10] | 2022 | √ | Personalization in HAR | A, G, M | UniMiB SHAR | 17 | √ | X | X | |||||
| Motion Sense | 6 | |||||||||||||
| MobiAct | 15 | |||||||||||||
| [11] | 2022 | √ | √ | HAR from piezoelectric-based kinetic energy signals | KEH transducers | KEH | Hand, waist | 5 | √ | X | X | |||
| [12] | 2022 | √ | Feature extraction-based approach | A | WISDM | 6 | √ | X | X | |||||
| [13] | 2022 | √ | √ | √ | Comparative study on classifying human activities | A, G | UCI-HAR | Waist | 6 | √ | X | X | ||
| [14] | 2022 | √ | √ | Gesture recognition in videos | Camera | SG | Room | 4 | √ | X | X | |||
| [1] | 2021 | √ | HAR from highly sparse body sensor data | A, RFID | Roomset1 | Chest | 4 | √ | X | X | ||||
| Roomset2 | ||||||||||||||
| [2] | 2021 | √ | √ | √ | Feature extraction-based approach | A, G, M | UniMiB-SHAR | 17 | √ | X | X | |||
| [3] | 2021 | √ | √ | Biometric user identification | A, G | UCI-HAR | Waist | 6 | √ | X | X | |||
| USC-HAD | 12 | |||||||||||||
| [4] | 2021 | √ | √ | √ | HAR in smart homes | Env. sensors | Orange4Home | Room | 24 | √ | X | X | ||
| [5] | 2021 | √ | √ | Industry 4.0-oriented approach | A | WISDM | 6 | √ | X | X | ||||
| [6] | 2021 | √ | √ | Human pose and motion estimation | Camera | SG | Room | 5 | √ | X | X | |||
| Env. sensors | 8 | |||||||||||||
| [15] | 2021 | √ | √ | Hybrid deep-learning-based model | Motion Kinect sensor | SG | Room | 12 | √ | X | X | |||
| [16] | 2021 | √ | HAR using skeleton datasets | Camera | UTD-MHAD | Room | 27 | √ | X | X | ||||
| MSR-Action3D | 20 | |||||||||||||
| [17] | 2021 | √ | Feature fusion-based approach | A, G, M | MHEALTH | Ankle, arm, chest | 12 | √ | X | X | ||||
| [18] | 2021 | √ | √ | Causality feature extraction based approach | Env. sensors | Aruba | Room | 10 | √ | X | X | |||
| Milan | 15 | |||||||||||||
| Cairo | 13 | |||||||||||||
| [19] | 2021 | √ | √ | HAR based on the Inception-ResNet model | A, G | UCI-HAR | Waist | 6 | √ | X | X | |||
| Env. and body sensors | Opportunity | Room, body | 18 | |||||||||||
| A | Daphnet | Legs, hip | 2 | |||||||||||
| A, G, M | PAMAP2 | Chest, ankle | 18 | |||||||||||
| [20] | 2021 | √ | √ | Multiple domain DL framework | A, G, M | SG | Head, wrist, leg | 12 | √ | X | X | |||
| [21] | 2021 | √ | Feature-fusion-based approach | A, G | SG | Waist | 6 | √ | X | X | ||||
| UCI-HAR | ||||||||||||||
| [22] | 2021 | √ | √ | Recognizing transitional activities | A, G | HAPT | Waist | 12 | √ | X | X | |||
| HAD | 5 | |||||||||||||
| [23] | 2021 | √ | √ | Optimal deep-learning-based approach | A, G | UCI-HAR | Waist | 6 | √ | X | X | |||
| USC-HAD | 12 | |||||||||||||
| [24] | 2021 | √ | √ | HAR using time-series data | A | UniMiB SHAR | 17 | √ | X | X | ||||
| A | WISDM | 6 | ||||||||||||
| A | UCI-HAR | Waist | 6 | |||||||||||
| [25] | 2021 | √ | HAR on microcontrollers | A, G, M | PAMAP2 | Hand, chest, ankle | 12 | √ | X | X | ||||
| [26] | 2021 | √ | √ | √ | Hybrid deep-learning-based approach | A, G | UCI-HAR | Waist | 6 | √ | X | X | ||
| A | WISDM | |||||||||||||
| [27] | 2021 | √ | Feature-fusion-based approach | A, G | SG | 6 | √ | X | X | |||||
| UCI-HAR | Waist | |||||||||||||
| [28] | 2021 | √ | √ | √ | Attention-based mechanism | A, G | HHAR | Hand, chest, ankle | 6 | √ | X | X | ||
| A, G, M | PAMAP2 | 12 | ||||||||||||
| A, G | USC-HAD | 12 | ||||||||||||
| [29] | 2021 | √ | √ | HAR using multimodal sensors | Multimodal | CMU-MMAC | Room | 11 | √ | X | X | |||
| [30] | 2021 | √ | Multimodal complex HAR | A, G, M | Lifelog | Pocket, wrist, chest | 9 | √ | X | X | ||||
| PAMAP2 | Wrist, arm, chest, | 18 | ||||||||||||
| [31] | 2021 | √ | √ | Hierarchical hybrid deep-learning-based approach | A, G | UCI-HAPT | Waist | 12 | √ | X | X | |||
| MobiAct | 11 | |||||||||||||
| [32] | 2021 | √ | Resource-constrained HAR | EMG sensors | Myo-TL | Elbow, wrist | 9 | √ | X | X | ||||
| Db5 | 18 | |||||||||||||
| Our Approach | √ | Human activity recognition on signal images (HARSI) | A | WISDM | 6 | √ | √ | |||||||
| Model | Learning Rate | Activation Function | Optimizer | Loss Function | Total Parameters | Total Trainable Parameters | Total Nontrainable Parameters | 
|---|---|---|---|---|---|---|---|
| HARSI-ResNet34 | 2 × 10−3 | ReLU (Rectified Linear Unit) | Adam (Adaptive Moment Estimation) | Cross Entropy | 21,815,104 | 547,456 | 21,267,648 | 
| HARSI-ResNet50 | 6 × 10−4 | 25,617,472 | 2,162,560 | 23,454,912 | |||
| HARSI-ResNet101 | 1 × 10−3 | 44,609,600 | 2,214,784 | 42,394,816 | |||
| HARSI-AlexNet | 2 × 10−3 | 2,736,960 | 267,264 | 2,469,696 | |||
| HARSI-DenseNet121 | 4 × 10−4 | 8,010,624 | 1,140,416 | 6,870,208 | |||
| HARSI-SqueezeNet_v1.0 | 2 × 10−3 | 1,265,856 | 530,432 | 735,424 | |||
| HARSI-SqueezeNet_v1.1 | 1 × 10−3 | 1,252,928 | 530,432 | 722,496 | |||
| HARSI-VGG16 | 1 × 10−3 | 15,253,568 | 538,880 | 14,714,688 | |||
| HARSI-VGG19 | 2 × 10−3 | 20,565,824 | 541,440 | 20,024,384 | 
| Model | Accuracy (%) | Precision | Recall | F-Measure | 
|---|---|---|---|---|
| HARSI-ResNet34 | 97.33 | 0.97433 | 0.97333 | 0.97326 | 
| HARSI-ResNet50 | 96.00 | 0.96054 | 0.96000 | 0.96006 | 
| HARSI-ResNet101 | 97.17 | 0.97194 | 0.97166 | 0.97161 | 
| HARSI-AlexNet | 89.17 | 0.89018 | 0.89166 | 0.89077 | 
| HARSI-DenseNet121 | 96.67 | 0.96695 | 0.96666 | 0.96671 | 
| HARSI-SqueezeNet_v1.0 | 89.67 | 0.90025 | 0.89666 | 0.89725 | 
| HARSI-SqueezeNet_v1.1 | 93.00 | 0.92915 | 0.93000 | 0.92937 | 
| HARSI-VGG16 | 96.83 | 0.96871 | 0.96833 | 0.96826 | 
| HARSI-VGG19 | 98.00 | 0.97999 | 0.98000 | 0.97999 | 
| Ref. | Year | Method | Accuracy (%) | 
|---|---|---|---|
| [12] | 2022 | Support Vector Machines | 87.40 | 
| Random Forest | 86.10 | ||
| [34] | 2021 | Decision Tree | 82.00 | 
| Logistic Regression | 68.00 | ||
| Multilayer Perceptron | 80.00 | ||
| Neural Networks | 94.00 | ||
| [35] | 2021 | Decision Tree | 89.76 | 
| Linear Discriminant Analysis | 86.64 | ||
| Gradients Boosting | 89.65 | ||
| K-Nearest Neighbors | 92.54 | ||
| Bagging | 92.48 | ||
| Random Forest | 92.71 | ||
| Linear Kernel SVM | 78.55 | ||
| RBF Kernel SVM | 89.07 | ||
| Polynomial Kernel SVM | 92.48 | ||
| [36] | 2021 | Random Forest | 79.38 | 
| K-Nearest Neighbors | 75.04 | ||
| Decision Tree | 77.60 | ||
| Gradient Boosting | 74.80 | ||
| [37] | 2020 | Random Forest | 83.35 | 
| Neural Networks | 77.02 | ||
| Decision Tree (J48) | 75.96 | ||
| Reduced-Error Pruning (REP) Tree | 74.64 | ||
| K-Nearest Neighbors | 72.08 | ||
| KStar | 71.84 | ||
| Naive Bayes | 63.89 | ||
| Support Vector Machines | 55.45 | ||
| [38] | 2020 | Random Forest | 92.78 | 
| Support Vector Machines | 91.39 | ||
| [39] | 2020 | Neural Networks | 89.10 | 
| Decision Tree | 87.45 | ||
| Support Vector Machines | 95.13 | ||
| Linear Support Vector Classifier | 86.20 | ||
| Logistic Regression | 81.10 | ||
| Random Forest | 82.10 | ||
| [40] | 2020 | Support Vector Machines | 82.00 | 
| [41] | 2020 | Multilayer Perceptron | 86.95 | 
| [42] | 2019 | K-Nearest Neighbors | 92.00 | 
| Support Vector Machines | 93.50 | ||
| Bagging | 93.80 | ||
| [43] | 2018 | Random Forest | 82.66 | 
| K-Nearest Neighbors | 66.19 | ||
| [44] | 2018 | Support Vector Machines | 82.27 | 
| [45] | 2018 | Naive Bayes | 80.12 | 
| Decision Tree | 81.02 | ||
| K-Nearest Neighbors | 77.58 | ||
| Support Vector Machines | 80.93 | ||
| [46] | 2017 | Naive Bayes Tree | 87.70 | 
| Multilayer Perceptron | 77.52 | ||
| DT + LR + MLP | 91.62 | ||
| NB Tree + MLP | 96.35 | ||
| [47] | 2016 | AdaBoost + J48 | 97.83 | 
| AdaBoost + REP Tree | 97.33 | ||
| AdaBoost + Random Tree | 95.69 | ||
| AdaBoost + Random Forest | 94.44 | ||
| AdaBoost + Hoeffding Tree | 87.84 | ||
| AdaBoost + Decision Stump | 57.31 | ||
| [48] | 2015 | Decision Tree (J48) | 86.08 | 
| Logistic Regression | 77.52 | ||
| Multilayer Perceptron | 88.81 | ||
| J48 + LR + MLP | 91.62 | ||
| [49] | 2015 | Decision Tree (J48) | 92.40 | 
| Logistic Regression | 84.30 | ||
| Multilayer Perceptron | 91.70 | ||
| J48 + LR + MLP | 93.00 | ||
| [50] | 2015 | Neural Networks with Dropout | 85.36 | 
| Random Forest | 83.46 | ||
| [33] | 2010 | Logistic Regression | 78.10 | 
| Decision Tree (J48) | 85.10 | ||
| Multilayer Perceptron | 91.70 | ||
| Average | 84.28 | ||
| Our Approach | Human Activity Recognition on Signal Images (HARSI) | 98.00 | |
| Ref. | Year | Method | Accuracy(%) | 
|---|---|---|---|
| [12] | 2022 | CNN—Transfer Learning | 90.40 | 
| Convolutional Neural Networks | 88.20 | ||
| [5] | 2021 | CNN + Long Short-Term Memory | 97.76 | 
| Long Short-Term Memory | 96.61 | ||
| Convolutional Neural Networks | 94.51 | ||
| [13] | 2021 | Deep Neural Networks | 93.00 | 
| [24] | 2021 | Vanilla RNN + LSTM + GRU | 97.13 | 
| [26] | 2021 | CNN + Random Forest | 97.77 | 
| Deep Neural Networks | 74.00 | ||
| Deep Neural Networks + LSTM | 81.00 | ||
| Deep Neural Networks + Gated Recurrent Unit (GRU) | 80.00 | ||
| Convolutional Neural Networks | 88.00 | ||
| Convolutional Neural Networks + LSTM | 94.00 | ||
| Convolutional Neural Networks + GRU | 82.00 | ||
| [34] | 2021 | Deep Neural Networks | 95.00 | 
| [51] | 2021 | Residual Network | 95.66 | 
| Convolutional Neural Networks | 92.19 | ||
| [52] | 2021 | Deep Convolutional Neural Networks | 91.25 | 
| [53] | 2021 | 1D Convolutional Neural Networks | 91.12 | 
| 1D CNN + Fuzzy Neural Network | 92.96 | ||
| [54] | 2021 | Ensemble of Autoencoders (EAE) | 82.00 | 
| KNN + Very Fast Decision Tree + Naive Bayes (EkVN) | 73.00 | ||
| [55] | 2021 | NOvelty discrete data stream for Human Activity Recognition (NOHAR) | 93.00 | 
| [56] | 2021 | Deep Convolutional Neural Networks Ensemble | 89.01 | 
| [57] | 2021 | Convolutional AutoEncoder (CAE) | 95.60 | 
| [58] | 2021 | Convolutional Neural Networks | 95.00 | 
| Long Short-Term Memory | 97.50 | ||
| [59] | 2020 | Convolutional Neural Networks | 93.25 | 
| [60] | 2020 | Deep Convolutional Neural Networks | 94.18 | 
| Region-based CNN | 93.68 | ||
| [61] | 2020 | CNN—DenseNet | 94.65 | 
| [62] | 2020 | Bidirectional Long Short-Term Memory | 94.10 | 
| [38] | 2020 | Genetic algorithm-based classifier | 95.37 | 
| [39] | 2020 | Convolutional Neural Networks | 83.98 | 
| Long Short-Term Memory | 95.45 | ||
| [63] | 2020 | LSTM–Convolutional Neural Networks | 95.75 | 
| [64] | 2020 | Lightweight Recurrent Neural Network—LSTM | 95.78 | 
| [65] | 2020 | Multihead Convolutional Attention | 95.40 | 
| [66] | 2020 | Two-Stage End-to-end CNN with data augmentation (TSE + CNN + Aug) | 95.70 | 
| [41] | 2020 | Gramian Angular Field + Multidilated Kernel Residual Network | 96.83 | 
| Long Short-Term Memory | 87.53 | ||
| 1D Convolutional Neural Network | 93.66 | ||
| [6] | 2020 | EnsemConvNet (CNN-Net + Encoded-Net + CNN-LSTM) | 97.20 | 
| [68] | 2020 | Convolutional Neural Networks | 97.51 | 
| [69] | 2020 | Convolutional Neural Networks | 94.11 | 
| Residual Network | 95.72 | ||
| Residual Network of Residual Network | 96.73 | ||
| [40] | 2020 | Convolutional Neural Networks | 81.70 | 
| Recurrent Convolutional Network (RCN) | 94.00 | ||
| Recurrent Convolutional Network + SVM | 91.50 | ||
| [70] | 2019 | U-Net | 96.40 | 
| Mask Region-based CNN (R-CNN) | 86.20 | ||
| SegNet: A Deep Convolutional Encoder-Decoder Arch. | 95.70 | ||
| Full Convolutional Network (FCN) | 87.90 | ||
| Deep Convolutional and LSTM | 94.80 | ||
| Long Short-Term Memory | 93.80 | ||
| Convolutional Neural Networks | 94.10 | ||
| [71] | 2019 | LSTM–Recurrent Neural Networks | 93.81 | 
| [42] | 2019 | Supervised Regularization-based Robust Subspace (SRRS) | 93.50 | 
| Robust Principal Component Analysis | 85.70 | ||
| Latent Low-Rank Representation (LLRR) | 91.90 | ||
| Joint Embedding Learning and Sparse Regression (JELSR) | 73.40 | ||
| Principal Component Analysis (PCA) | 92.30 | ||
| Linear Discriminant Analysis (LDA) | 71.50 | ||
| [72] | 2019 | Long Short-Term Memory | 97.00 | 
| [44] | 2018 | Convolutional Neural Networks | 91.97 | 
| [45] | 2018 | Multivariate Bag-Of-SFA-Symbols | 83.35 | 
| [73] | 2018 | Deep Autoencoder-Set Network | 94.90 | 
| [74] | 2018 | Long Short-Term Memory | 97.00 | 
| [43] | 2018 | Convolutional Neural Networks | 93.32 | 
| [75] | 2017 | Impersonal Smartphone-based Activity Recognition (ISAR) | 75.21 | 
| [76] | 2016 | Long Short-Term Memory | 92.10 | 
| [77] | 2015 | STream learning for mobile Activity Recognition (STAR) | 71.20 | 
| Average | 90.94 | ||
| OurApproach | Human Activity Recognition on Signal Images (HARSI) | 98.00 | |
| Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. | 
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Cengiz, A.B.; Birant, K.U.; Cengiz, M.; Birant, D.; Baysari, K. Improving the Performance and Explainability of Indoor Human Activity Recognition in the Internet of Things Environment. Symmetry 2022, 14, 2022. https://doi.org/10.3390/sym14102022
Cengiz AB, Birant KU, Cengiz M, Birant D, Baysari K. Improving the Performance and Explainability of Indoor Human Activity Recognition in the Internet of Things Environment. Symmetry. 2022; 14(10):2022. https://doi.org/10.3390/sym14102022
Chicago/Turabian StyleCengiz, Ayse Betul, Kokten Ulas Birant, Mehmet Cengiz, Derya Birant, and Kemal Baysari. 2022. "Improving the Performance and Explainability of Indoor Human Activity Recognition in the Internet of Things Environment" Symmetry 14, no. 10: 2022. https://doi.org/10.3390/sym14102022
APA StyleCengiz, A. B., Birant, K. U., Cengiz, M., Birant, D., & Baysari, K. (2022). Improving the Performance and Explainability of Indoor Human Activity Recognition in the Internet of Things Environment. Symmetry, 14(10), 2022. https://doi.org/10.3390/sym14102022
 
        



 
       