How to Prevent Drivers before Their Sleepiness Using Deep Learning-Based Approach
Abstract
:1. Introduction
- First, the detection of facial landmarks and the 3D head position MediaPipe Face Mesh which uses approaches linked to machine learning and has shown its robustness in terms of its accuracy and speed compared to other approaches in the literature.
- Second, estimating the driver’s gaze based on the relationship between the iris and the eyes makes it possible to get a better idea of how tired or distracted the driver is.
- Third, the calculation of a normalized image of the iris is seen as a way to add more useful features to the MobileNetV3 model.
- Fourth, the concatenation of several features linked to the eyes and head position gives us more information, which makes it easier to find out if the driver is tired.
- Fifth, the choice of the deep neural networks by the LSTM in our system in order to take not only the current state of the driver but also the previous states.
- Sixth, we conduct our first experiment, which lets us pick the best hyperparameters to make our training model better.
- Seventh, a detection of five levels of driver states is realized in this work in order to alert the driver to their state of hypovigilance to avoid accidents.
2. Related Works
2.1. Eye Blinking Approaches
2.2. Approaches Based on Mouth and Yawning Detection
2.3. Approaches Based on Several Components of the Face
2.4. Discussion
3. Methodology
3.1. Head Pose Estimation and Facial Features Extraction
3.2. Eye Blinking Rate Detection
3.3. Iris Position According to Eye Corners
3.4. Iris and Its Surroundings Features Extraction
3.5. MobileNetV3 Transfer Learning for Iris Features Extraction
3.6. Deep LSTM Network for Multilevel Fatigue Classification
4. Experimentation and Analysis
4.1. Corpus and Experimental Data
4.2. Evaluation Metrics
4.3. Experiments for Face Detection
4.4. Hyperparameter Tuning, Results, and Discussion
5. Conclusions and Future Works
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Sigari, M.-H.; Fathy, M.; Soryani, M. A Driver Face Monitoring System for Fatigue and Distraction Detection. Int. J. Veh. Technol. 2013, 2013, 1–11. [Google Scholar] [CrossRef] [Green Version]
- Brandt, T.; Stemmer, R.; Rakotonirainy, A. Affordable visual driver monitoring system for fatigue and monotony. In Proceedings of the IEEE International Conference on Systems, Man and Cybernetics (SMC), Hague, The Netherlands, 10–13 October 2004; pp. 6451–6456. [Google Scholar]
- Guo, Y.; Xia, Y.; Wang, J.; Yu, H.; Chen, R.-C. Real-Time Facial Affective Computing on Mobile Devices. Sensors 2020, 20, 870. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Baddar, J.-T.; Man, R.Y. Mode Variational LSTM Robust to Unseen Modes of Variation: Application to Facial Expression Recognition. In Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI), Kilauea Volcanic Site, Honolulu, HI, USA, 26 January 2019; pp. 3215–3223. [Google Scholar]
- Kansizoglou, I.; Bampis, L.; Gasteratos, A. An Active Learning Paradigm for Online Audio-Visual Emotion Recognition. IEEE Trans. Affect. Comput. 2022, 13, 756–768. [Google Scholar] [CrossRef]
- Picard, R.W. Affective Computing, 1st ed.; MIT Press: Cambridge, MA, USA, 1997; pp. 1–30. [Google Scholar]
- Deng, J.; Guo, N.; Zafeiriou, S. ArcFace: Additive Angular Margin Loss for Deep Face Recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; pp. 4685–4694. [Google Scholar]
- Kansizoglou, I.; Bampis, L.; Gasteratos, A. Deep Feature Space: A Geometrical Perspective. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 6823–6838. [Google Scholar] [CrossRef]
- Gochoo, M.; Rizwan, S.A.; Ghadi, Y.Y.; Jalal, A.; Kim, K. A Systematic Deep Learning Based Overhead Tracking and Counting System Using RGB-D Remote Cameras. Appl. Sci. 2021, 11, 5503. [Google Scholar] [CrossRef]
- Cyganek, B.; Gruszczyński, S. Hybrid computer vision system for drivers’ eye recognition and fatigue monitoring. Neurocomputing 2014, 126, 78–94. [Google Scholar] [CrossRef]
- Gou, C.; Wu, Y.; Wang, K.; Wang, F.; Ji, Q. A joint cascaded framework for simultaneous eye detection and eye state estimation. Pattern Recognit. 2017, 67, 23–31. [Google Scholar] [CrossRef] [Green Version]
- Ibrahim, L.F.; Abulkhair, M. Using Haar classifiers to detect driver fatigue and provide alerts. Multimed. Tools Appl. 2014, 71, 1857–1877. [Google Scholar] [CrossRef]
- Mandal, B.; Li, L.; Wang, G.; Lin, J. Towards detection of bus driver fatigue based on robust visual analysis of eye state. IEEE Trans. Intell. Transp. Syst. 2017, 18, 545–557. [Google Scholar] [CrossRef]
- Song, F.; Tan, X.; Liu, X.; Chen, S. Eyes closeness detection from still images with multi-scale histograms of principal oriented gradients. Pattern Recogn. 2014, 47, 2825–2838. [Google Scholar] [CrossRef]
- You, F.; Li, Y.H.; Huang, L.; Chen, K.; Zhang, R.H.; Xu, J.M. Monitoring drivers’ sleepy status at night based on machine vision. Multimed. Tools Appl. 2017, 76, 14869–14886. [Google Scholar] [CrossRef]
- Zhao, L.; Wang, Z.; Zhang, G.; Qi, Y.; Wang, X. Eye state recognition based on deep integrated neural network and transfer learning. Multimed. Tools Appl. 2018, 76, 19415–19438. [Google Scholar] [CrossRef]
- Anitha, J.; Mani, G.; Venkata Rao, K. Driver Drowsiness Detection Using Viola Jones Algorithm. In Smart Intelligent Computing and Applications; Suresh, S., Vikrant, B., Mohanty, J., Siba, K., Eds.; Springer: Singapore, 2020; pp. 583–592. [Google Scholar]
- Akrout, B.; Mahdi, B. Yawning detection by the analysis of variational descriptor for monitoring driver drowsiness. In Proceedings of the IEEE International Image Processing, Applications and Systems (IPAS), Hammamet, Tunisia, 5–7 November 20016; pp. 1–5. [Google Scholar]
- Omidyeganeh, M.; Shirmohammadi, S.; Abtahi, S.; Khurshid, A.; Farhan, M.; Scharcanski, J.; Hariri, B.; Laroche, D.; Martel, L. Yawning detection using embedded smart cameras. IEEE Trans. Instrum. Meas. 2016, 65, 570–582. [Google Scholar] [CrossRef]
- Zhang, W.; Su, J. Driver yawning detection based on long short term memory networks. In Proceedings of the IEEE Symposium Series on Computational Intelligence (SSCI), Honolulu, HI, USA, 27 November–1 December 2017; pp. 1–5. [Google Scholar]
- Zhang, W.; Murphey, Y.L.; Wang, T.; Xu, Q. Driver yawning detection based on deep convolutional neural learning and robust nose tracking. In Proceedings of the IEEE International Joint Conference on Neural Networks (IJCNN), Killarney, Ireland, 12–17 July 2015; pp. 1–8. [Google Scholar]
- Wang, R.; Tong, B.; Jin, L. Monitoring mouth movement for driver fatigue or distraction with one camera. In Proceedings of the International IEEE Conference on Intelligent Transportation Systems, Washington, WA, USA, 3–6 October 2004; pp. 314–319. [Google Scholar]
- Torres, R.; Ohashi, O.; Pessin, G. A Machine-Learning Approach to Distinguish Passengers and Drivers Reading While Driving. Sensors 2019, 19, 3174. [Google Scholar] [CrossRef] [Green Version]
- Benoit, A.; Bonnaud, L.; Caplier, A.; Ngo, P.; Lawson, L.; Trevisan, G.; Levacic, V.; Mancas, C.; Chanel, G. Multimodal focus attention and stress detection and feedback in an augmented driver simulator. Pers. Ubiquitous Comput. 2009, 13, 33–41. [Google Scholar] [CrossRef]
- Yu, C.; Qin, X.; Chen, Y.; Wang, J.; Fan, C. DrowsyDet: A Mobile Application for Real-time Driver Drowsiness Detection. In Proceedings of the IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computing, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation, Leicester, UK, 19–23 August 2019; pp. 425–432. [Google Scholar]
- Shih, T.H.; Hsu, C.T. Multistage Spatial-Temporal Network for Driver Drowsiness Detection. In Proceedings of the Computer Vision–ACCV 2016 Workshops (ACCV), Killarney, Ireland, 20–24 November 2016; pp. 146–153. [Google Scholar]
- Hossain, M.; Rahman, M.; Islam, M.; Arnisha, A.; Uddin, M.; Bikash, P. Automatic driver distraction detection using deep convolutional neural networks. Intell. Syst. Appl. 2022, 14, 200075. [Google Scholar] [CrossRef]
- Chen, L.; Xin, G.; Liu, Y.; Huang, J. Driver Fatigue Detection Based on Facial Key Points and LSTM. Secur. Commun. Netw. 2021, 2021, 5383573. [Google Scholar] [CrossRef]
- Akrout, B.; Mahdi, W. A novel approach for driver fatigue detection based on visual characteristics analysis. J. Ambient. Intell. Human. Comput. 2023, 14, 527–552. [Google Scholar] [CrossRef]
- Eraqi, H.; Abouelnaga, Y.; Saad, H.; Moustafa, N. Driver distraction identification with an ensemble of convolutional neural networks. J. Adv. Transp. 2019, 14, 1–12. [Google Scholar] [CrossRef]
- Arunasalam, M.; Yaakoba, N.; Amir, A.; Elshaikh, M.; Azahar, N.-F. Real-Time Drowsiness Detection System for Driver Monitoring. IOP Conf. Ser. Mater. Sci. Eng. 2020, 767, 1–12. [Google Scholar] [CrossRef]
- Kartynnik, Y.; Ablavatski, A.; Grishchenko, I.; Grundmann, M. Real-time Facial Surface Geometry from Monocular Video on Mobile GPUs. arXiv 2019, arXiv:1907.06724. [Google Scholar]
- Howard, A.; Sandler, M.; Chen, B.; Wang, W.; Chen, L.-C.; Tan, M.; Chu, G.; Vasudevan, V.; Zhu, Y.; Pang, R.; et al. Searching for MobileNetV3. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Seoul, Republic of Korea, 27 October–2 November 2019; pp. 1314–1324. [Google Scholar]
- Bulat, A.; Tzimiropoulos, G. How Far are We from Solving the 2D & 3D Face Alignment Problem? (and a Dataset of 230,000 3D Facial Landmarks). In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 1021–1030. [Google Scholar]
- Volker, B.; Thomas, V. A morphable model for the synthesis of 3D faces. In Proceedings of the Internaional Conference and Exhibition on Computer Graphics and Interactive Techniques (SIGGRAPH), New York, NY, USA, 1 July 1999; pp. 187–194. [Google Scholar]
- Artsiom, A.; Ablavatski, A.; Andrey, V.; Ivan, G.; Karthik, R.; Matsvei, Z. Real-time Pupil Tracking from Monocular Video for Digital Puppetry. arXiv 2020, arXiv:2006.11341, 1. [Google Scholar]
- Daugman, J.G. High Confidence Visual Recognition of Persons by a Test of Statistical Independence. IEEE Trans. Pattern Anal. Mach. Intell. 1993, 14, 1148–1161. [Google Scholar] [CrossRef] [Green Version]
- Howard, A.G.; Zhu, M.; Chen, B.; Kalenichenko, D.; Wang, W.; Weyand, T.; Andreetto, M.; Adam, H. Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv 2017, arXiv:1704.04861. [Google Scholar]
- Sandler, M.; Howard, A.; Zhu, M.; Zhmoginov, A.; Chen, L.-C. MobileNetV2: Inverted Residuals and Linear Bottlenecks. arXiv 2018, arXiv:1801.04381. [Google Scholar]
- Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural. Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
- Akrout, B.; Mahdi, W. Spatio-temporal features for the automatic control of driver drowsiness state and lack of concentration. Mach. Vis. Appl. 2015, 26, 1–13. [Google Scholar] [CrossRef]
- Abtahi, S.; Omidyeganeh, M.; Shirmohammadi, S.; Hariri, B. Yawdd: A yawning detection dataset. In Proceedings of the 5th ACM Multimedia Systems Conference (MMSys), Singapore, 19 March 2014; pp. 24–28. [Google Scholar]
- Koelstra, S.; Muhl, C.; Soleymani, M.; Lee, J.-S.; Yazdani, A.; Ebrahimi, T.; Pun, T.; Nijholt, A.; Patras, I. DEAP:A Database for Emotion Analysis Using Physiological Signals. IEEE Trans. Affect. Comput. 2012, 3, 18–31. [Google Scholar] [CrossRef] [Green Version]
- Picot, A.; Charbonnier, S.; Caplier, A.; Vu, N.-S. Using Retina Modelling to Characterize Blinking: Comparison between EOG and Video Analysis. Mach. Vis. Appl. 2011, 23, 1195–1208. [Google Scholar] [CrossRef]
- Viola, P.; Jones, M. Rapid object detection using a boosted cascade of simple features. In Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), Kauai, HI, USA, 8–14 December 2001; p. I. [Google Scholar]
- Boyko, N.; Basystiuk, O.; Shakhovska, N. Performance Evaluation and Comparison of Software for Face Recognition, Based on Dlib and Opencv Library. In Proceedings of the IEEE Second International Conference on Data Stream Mining & Processing (DSMP), Lviv, Ukraine, 21–25 August 2018; pp. 478–482. [Google Scholar]
- Xiao, W.; Liu, H.; Ma, Z.; Chen, W.; Sun, C.; Shi, B. Fatigue Driving Recognition Method Based on Multi-Scale Facial Landmark Detector. Electronics 2022, 11, 4103. [Google Scholar] [CrossRef]
- Shang, Y.; Yang, M.; Cui, J.; Cui, L.; Huang, Z.; Li, X. Driver Emotion and Fatigue State Detection Based on Time Series Fusion. Electronics 2022, 12, 26. [Google Scholar] [CrossRef]
Viola and Jone | Dlib Hog | MediaPipe Face Mesh | |
---|---|---|---|
DEP | 0.86 | 0.91 | 0.98 |
YawDD | 0.81 | 0.9 | 0.97 |
MiraclHB | 0.91 | 0.93 | 0.99 |
Average | 0.86 | 0.91 | 0.98 |
Hyperparameters | Values |
---|---|
Hidden-layer number | 2 |
Learning rate | 0.0005 |
First hidden-layer neurons | 1024 |
Second hidden-layer neurons | 512 |
Fatigue Level | Recall | Precision | F-Measure |
---|---|---|---|
Awake | 0.932 | 0.924 | 0.928 |
A little tired | 0.921 | 0.912 | 0.916 |
Tired | 0.925 | 0.923 | 0.924 |
Very tired | 0.942 | 0.952 | 0.947 |
Hypovigilance | 0.946 | 0.935 | 0.940 |
Average | 0.933 | 0.929 | 0.931 |
Fatigue Level | Recall | Precision | F-Measure |
---|---|---|---|
Awake | 0.987 | 0.988 | 0.987 |
A little tired | 0.921 | 0.926 | 0.923 |
Tired | 0.951 | 0.923 | 0.937 |
Very tired | 0.962 | 0.951 | 0.956 |
Hypovigilance | 0.989 | 0.994 | 0.991 |
Average | 0.962 | 0.956 | 0.959 |
Fatigue Level | Recall | Precision | F-Measure |
---|---|---|---|
Awake | 0.987 | 0.988 | 0.987 |
A little tired | 0.981 | 0.984 | 0.982 |
Tired | 0.979 | 0.982 | 0.980 |
Very tired | 0.984 | 0.981 | 0.982 |
Hypovigilance | 0.989 | 0.983 | 0.986 |
Average | 0.984 | 0.984 | 0.984 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Akrout, B.; Fakhfakh, S. How to Prevent Drivers before Their Sleepiness Using Deep Learning-Based Approach. Electronics 2023, 12, 965. https://doi.org/10.3390/electronics12040965
Akrout B, Fakhfakh S. How to Prevent Drivers before Their Sleepiness Using Deep Learning-Based Approach. Electronics. 2023; 12(4):965. https://doi.org/10.3390/electronics12040965
Chicago/Turabian StyleAkrout, Belhassen, and Sana Fakhfakh. 2023. "How to Prevent Drivers before Their Sleepiness Using Deep Learning-Based Approach" Electronics 12, no. 4: 965. https://doi.org/10.3390/electronics12040965
APA StyleAkrout, B., & Fakhfakh, S. (2023). How to Prevent Drivers before Their Sleepiness Using Deep Learning-Based Approach. Electronics, 12(4), 965. https://doi.org/10.3390/electronics12040965