Robust Iris Segmentation with Deep CNNs for Detecting Fully or Nearly Closed Eyes in Non-Ideal Biometric Systems
Abstract
1. Introduction
Research Gap
- Efficient detection of fully or nearly closed eyes in covert setups.
- Real-time robust iris segmentation in the presence of noise.
- The overall efficiency of the resultant iris biometric system has been improved.
- Using the proposed DCCN model as a preprocessing step, contemporary iris segmentation and localization schemes can also qualify for real-time applications.
- The proposed approach equally works for both visible spectrum (VS) and near-infrared (NIR) images.
2. Proposed Algorithm
- ○
- DCNN model for fully/nearly closed eyes detection;
- ○
- Enhanced iris biometric system.
2.1. DCNN Model for Fully/Nearly Closed Eyes Detection
- ○
- Preprocessing;
- ○
- Dataset augmentation;
- ○
- Transfer learning.
2.1.1. Preprocessing
- Step-a:
- Color (RGB) requirements: All existing P-DCNN models (e.g., GoogleNet, SqueezeNet, and VGG19) work only with RGB (i.e., visible spectrum) images. The proposed scheme in this step checks the RGB format of the input image, say ∅ (x, y), as follows:If ∅ (x, y) is not an RGB image, then it copies gray-level values of ∅ (x, y) into red, green, and blue planes to construct a new RGB equivalent image C (x, y), such asOtherwise,As shown in Figure 4, Equation (1) does not bring any change in the visual perception of a gray-level image ∅ (x, y); it now has three planes like an RGB image. As C (x, y) has original gray-level values of ∅ (x, y) in its red, green, and blue planes, the pretrained DCNN models accept C (x, y) as a color input image. Otherwise, these models will make an image dimensionality error.
- Step-b:
- Resolution requirements: As P-DCNN models accept images with a specific resolution, this step adjusts the resolution of C (x, y) accordingly. To begin with, the first proposed scheme explores the first layer of the target P-DCNN model, which has information about image resolution and RGB format. For example, the first layer of the “GoogleNet” shows as the input image requirement. It means C (x, y) must have 224 rows, 224 columns, and 3 planes (i.e., RGB). Otherwise, it would not be entertained by the input layer.
2.1.2. Dataset Augmentation
- ○
- Horizontal Translation: The proposed algorithm generates new images by randomly translating each sample image horizontally to both left and right sides; the translation range is preset to twenty pixels.
- ○
- Vertical Translation: The proposed algorithm also generates new images by randomly translating each sample image vertically up and down; the translation range is preset to twenty pixels.
- ○
- Rotation: In this step, the proposed algorithm generates new images by randomly rotating each sample image in both clockwise and anticlockwise directions; the range is preset to
2.1.3. Transfer Learning: Training P-DCCN Models on New Images
- ○
- Loading a P-DCNN model;
- ○
- Preparing base model (BM);
- ○
- Freezing BM initial layers;
- ○
- Training BM on the new dataset;
- ○
- Fine-tuning BM.
- Step-a:
- Loading P-DCNN Model: As pretrained P-DCNN models are the saved networks, this step loads one such model (e.g., SqueezeNet) along with its weights. All P-DCNN models have a layered architecture; therefore, their complexity depends on the number of hidden layers, overall architecture, trainable parameters, etc.
- Step-b:
- Preparing Base Model (BM): As P-DCNN models are already trained on images of 1000 classes, the loaded P-DCNN model has more neurons in the final output layer than required for the current case. Due to this reason, the author replaced the output layer (called classification layer or fullyConnectedLayer) with a new layer having only two classes, i.e., FullyAlmostClosedIrises and OpenIrises. The first one will classify fully or nearly closed eyes, whereas the second one will classify fully or almost open eyes. Additionally, the author also removed the final convolutional layer (known as convolution2dLayer) and replaced it with a new one. Convolution layers extract the vital low- and high-level features from target images during training and update their weights accordingly. The resultant P-DCNN model is now called the base model (BM).
- Step-c:
- Freezing BM Initial Layers: Initial layers of BM have knowledge regarding generic type features such as edges, color, and gradients. On the other hand, its final layers have knowledge of features specific to the target task. For this reason, it is wise to freeze starting layers to avoid the headache of retraining. Unfreezing initial layers will let them update their weight again (i.e., relearning) on the training dataset; notably, this act will not be different from training target BM from scratch, resulting in loss of technical resources, time, etc. For this reason, it is better to either freeze these layers or reduce their learning rate to a minimum level as compared with the last learnable layers.
- Step-d:
- Training BM: In this step, BM is given training on a newly developed dataset (see Section 3.2), including 600 eye images. Here, half of the images have closed eyes, and the remaining half have open eyes. This dataset is divided into training (70%) and validation (30%) sets. Though this dataset is relatively small, due to transfer learning and an augmentation scheme, it is sufficient to get the best results and avoid unnecessary overfitting. Before training begins, it is necessary to initialize the hyperparameters of the new fullyConnectedLayer and convolution2dLayer. Additionally, a choice for a Solver (i.e., optimizer) also needs to be selected. Table 2 shows a summary of hyperparameter initialization.With three choices for each hyperparameter, the total number of permutations for six hyperparameters is . It implies that a total of 729 clones of BM would be prepared after training, for each of the 19 P-DCNNs. That is, for 19 P-DCNN models, the total number of expected clones is . In this step, each hyperparameter is initialized to the first choice from its respective domain, and the training process is initiated.
- Step-e:
- BM Fine-tuning: For each trial, values of hyperparameters are recorded in conjunction with the mini-batch accuracy, validation accuracy, time elapsed, mini-batch loss, validation loss, etc. To automate this task, the authors utilized the MATLAB Experiment Manager (R2026), which is capable of executing trials in parallel or just sequentially. It also records all necessary training plots and confusion matrices. Fine-tuning of BM is essential while finding the best choice of a well-trained DCNN model for the current application. Figure 8 endorses this claim, because the validation accuracy of BM jumps from 50% to 98.89% by simply changing the value of InitialLearnRate from 0.1 to 0.001, respectively.
2.2. Proposed Enhanced Iris Biometric System
- ○
- Image acquisition setup,
- ○
- Iris segmentation,
- ○
- Features extraction, and
- ○
- Matching and recognition.
Proposed Iris Segmentation
- Step 1:
- The author transforms the input eye image to its grayscale format if it is already an RGB (color) image. The resultant image is , see Figure 11b. However, to reduce computational cost, he further resizes (decimates) the resultant image by retaining only 60% of the original pixels. This process effectively discards 40% of the alternate rows and columns. Notably, this act lowers the data volume and significantly accelerates the subsequent processing steps, particularly CHT, which is computationally an intensive algorithm. The process of decimated image can be expressed as follows:where is the scaling factor, and denote the dimensions of the resized image, and are the coordinates in the decimated image. No doubt, this operation ensures a uniform reduction in both row and column dimensions. After resizing, the author next applies a median filter of size to the resultant grayscale image to suppress noise and small artifacts such as eyelashes, reflections, and sensor noise. The median filtering operation is represented as follows:where is the neighborhood window centered at pixel . The resultant image is shown in Figure 11c. The author chose a median filter to clear out noise like eyelashes, hair, and light reflections. This filter is the best fit because it keeps the inner and outer iris boundaries sharp. Unlike a Gaussian filter, it does not blur edges. This keeps the limbic boundary clear for exact localization. It offers the perfect balance between removing noise and keeping the eye’s true shape. This makes the system much more reliable in messy or noisy images.Now, the author computes the global gray-level statistics of to estimate candidate regions corresponding to the pupil and the iris. These statistical parameters include the lower saturated limit , the upper saturated limit , and the average gray-level intensity . The lower and upper saturated limits offer robust approximations of the darkest and brightest regions in the input eye image, respectively, while the average intensity characterizes the overall gray-level distribution.
- ○
- : It represents gray-level intensity below which the bottom 5% of pixels lie. It basically refers to dark image regions, such as the pupil, eyelashes, eyebrows, or hair. It is expressed mathematically as follows:
- ○
- : It represents the intensity above which the top 5% of pixels lie. This parameter generally corresponds to bright regions such as the sclera or specular reflections. It is expressed mathematically as follows:
- ○
- : It represents the average gray-level intensity across the entire image, serving as a reference for intermediate gray regions, such as skin or iris regions. It is expressed mathematically as follows:where and denote the dimensions of .These parameters greatly facilitate the segmentation process by offering reference thresholds for characterizing dark, intermediate, and bright regions, which is specifically useful for localizing pupil and iris in non-ideal imaging conditions.
- Step 2:
- The author now applies the Canny edge detector to to produce an edge map , which highlights main intensity transitions in the eye, specifically at boundaries between pupil and iris and vice versa. These edges are crucial because the next stage, CHT, strongly relies on them while locating circular iris boundaries accurately. This process can be represented as follows:
- ○
- Smooth with a Gaussian filter to decrease noise.
- ○
- Compute intensity gradients to detect areas of instant change.
- ○
- Apply non-maximum suppression to thin the edges.
- ○
- Apply a double-thresholding scheme to keep only the strongest edges.
- Step 3:
- In this step, the author applies CHT to to detect circular contours representing the iris inner and outer boundaries. First, the candidate circles corresponding to the outer iris boundary are marked. Next, for each candidate, a circular region is extracted from the grayscale image and its average intensity is computed. If , the circle is considered a valid iris boundary; otherwise, the next candidate is accessed. The circle’s center coordinates and radius are recorded. This circular boundary can be expressed as follows:CHT itself operates using an accumulator function that is defined as follows:where counts the number of edge-pixels (x, y) supporting a circular contour with center at (a, b) and having radius , and symbol indicates the Dirac delta function. Generally, peaks in correspond to the target circular boundary.
- Step 4:
- Noise elements, including eyelids, eyelashes, and specular reflections, can be detected and marked at the recognition stage using masking techniques and morphological filtering [27].
3. Results and Discussion
3.1. Experimental Setup
- ○
- Operating system: Windows 10 Education.
- ○
- Simulation tool: MATLAB Version (R2020).
- ○
- Processor type: Intel(R) Core (TM) i7-8650U CPU 1.90 GHz 2.11 GHz.
- ○
- Installed RAM: 16.00 GB.
- ○
- Dataset: the main dataset has two folders, each representing the labeled data.
- ○
- OpenEyes: it has 300 fully/almost closed eye images.
- ○
- FullyAlmostClosedEyes: it has 300 fully/partially open eye images.
- ○
- P-DCNN models: GoogleNet, SqueezeNet, VGG16, VGG19, MobileNet-v2, ResNet18, ResNet50, ResNet101, Inception-v3, InceptionResNet-v2, AlexNet, DenseNet201, Xception, ShuffleNet, NASNet-Large, NASNet-Mobile, DarkNet19, DarkNet53, and EfficientNet-b0; in experimentation, these P-DCNN models are numbered from 1 to 19, respectively.
- ○
3.2. Preparing Target Dataset: Fully and Nearly Closed/Open Eyes
3.2.1. OpenEyes: Open Irises
3.2.2. FullyAlmostClosedEyes
3.3. DCNN Training and Validation Accuracy
3.4. Comparative Analysis: Proposed DCNN Model
3.5. Comparative Analysis: Proposed Segmentation Algorithm
4. Conclusions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| P-DCNN | Pretrained Deep Convolutional Neural Network. |
| NIR | Near infrared. |
| VS | Visible spectrum. |
| CHT/EHT | Circular/Elliptical Hough transform. |
| ACM | Active contours models. |
| IDO | Integro-differential operator. |
| ML | Machine learning. |
| DCNN | Deep convolutional neural network. |
| CNN | Convolutional neural network. |
| MTCNN | Multi-task cascaded CNN. |
| ECODN | Eye closed/open detection network. |
| MMU | Malaysia Multimedia University. |
| IITD | Indian Institute of Technology Delhi. |
| CASIA | Chinese Academy of Sciences’ Institute of Automation. |
| UBI | University of Beira Interior. |
| BM | Base model. |
References
- Rajaraman, S.; Antani, S.; Thoma, G.R. Deep Learning for Iris Recognition: A Survey and Analysis. Image Vis. Comput. 2024, 130, 104680. [Google Scholar]
- Chen, Y.; Wang, W.; Zeng, Z.; Wang, Y. An Adaptive CNNs Technology for Robust Iris Segmentation. IEEE Access 2019, 7, 64517–64532. [Google Scholar] [CrossRef]
- Gautam, G.; Mukhopadhyay, S. Challenges, taxonomy and techniques of iris localization: A survey. Digit. Signal Process. 2020, 107, 102852. [Google Scholar] [CrossRef]
- Wang, C.; Muhammad, J.; Wang, Y.; He, Z.; Sun, Z. Towards Complete and Accurate Iris Segmentation Using Deep Multi-Task Attention Network for Non-Cooperative Iris Recognition. IEEE Trans. Inf. Forensics Secur. 2020, 15, 2944–2959. [Google Scholar] [CrossRef]
- Khaki, A.; Aghagolzadeh, A.; Cami, B.R. ISUR: Iris Segmentation based on UNet and ResNet. In Proceedings of the 2021 11th International Conference on Computer Engineering and Knowledge (ICCKE), Mashhad, Iran, 28–29 October 2021; pp. 1–7. [Google Scholar]
- Shanto, S.H.; Ali, M.N.; Ahsan, S.M.M. An Advanced CNN Based Iris Recognition and Segmentation for Visible Spectrum Images. In Proceedings of the 2022 International Conference on Advancement in Electrical and Electronic Engineering (ICAEEE), Gazipur, Bangladesh, 24–26 February 2022; pp. 1–5. [Google Scholar]
- Agarwal, A.; Noore, A.; Vatsa, M.; Singh, R. Generalized Contact Lens Iris Presentation Attack Detection. IEEE Trans. Biom. Behav. Identity Sci. 2022, 4, 373–385. [Google Scholar] [CrossRef]
- Le-Tien, T.; Phan-Xuan, H.; Nguyen-Duy, P.; Le-Ba, L. Iris-based Biometric Recognition using Modified Convolutional Neural Network. In Proceedings of the 2018 International Conference on Advanced Technologies for Communications (ATC), Ho Chi Minh City, Vietnam, 18–20 October 2018; pp. 184–188. [Google Scholar]
- Rafik, H.D.; Boubaker, M. A Multi Biometric System Based on the Right Iris and the Left Iris Using the Combination of Convolutional Neural Networks. In Proceedings of the 2020 Fourth International Conference on Intelligent Computing in Data Sciences (ICDS), Fez, Morocco, 21–23 October 2020; pp. 1–10. [Google Scholar]
- Saraf, T.O.Q.; Fuad, N.; Taujuddin, N.S.A.M. Feature Encoding and Selection for Iris Recognition Based on Variable Length Black Hole Optimization. Computers 2022, 11, 140. [Google Scholar] [CrossRef]
- Galterio, M.G.; Shavit, S.A.; Hayajneh, T. A Review of Facial Biometrics Security for Smart Devices. Computers 2018, 7, 37. [Google Scholar] [CrossRef]
- Mayer, P.; Zou, Y.; Lowens, B.M.; Dyer, H.A.; Le, K.; Schaub, F.; Aviv, A.J. Awareness, Intention, (In)Action: Individuals’ Reactions to Data Breaches. ACM Trans. Comput. Hum. Interact. 2023, 30, 77. [Google Scholar] [CrossRef]
- Biometric_SmartCity. Available online: https://www.smartcity.press/cybersecurity-with-biometric-technology (accessed on 29 May 2023).
- Fu, K.; Zhao, Q.; Gu, I.; Yang, J. Deepside: A General Deep Framework for Salient Object Detection. Neurocomputing 2019, 356, 69–82. [Google Scholar] [CrossRef]
- Fu, K.; Fan, D.-P.; Ji, G.-P.; Zhao, Q. JL-DCF: Joint Learning and Densely-Cooperative Fusion Framework for RGB-D Salient Object Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 3049–3059. [Google Scholar]
- Fan, D.-P.; Liu, J.-J.; Gao, S.; Hou, Q.; Borji, A.; Cheng, M.-M. Salient Objects in Clutter: Bringing Salient Object Detection to the Foreground. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018. [Google Scholar]
- Wang, T.; Piao, Y.; Lu, H.; Li, X.; Zhang, L. Deep Learning for Light Field Saliency Detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea, 27 October–2 November 2019; pp. 8837–8847. [Google Scholar]
- Zhang, L.; Zhang, J.; Lin, Z.; Lu, H.; He, Y. CapSal: Leveraging Captioning to Boost Semantics for Salient Object Detection. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; pp. 6017–6026. [Google Scholar]
- Zhang, J.; Fan, D.-P.; Dai, Y.; Anwar, S.; Saleh, F.; Zhang, T.; Barnes, N. UC-Net: Uncertainty Inspired RGB-D Saliency Detection via Conditional Variational Autoencoders; IEEE: New York, NY, USA, 2020; pp. 8579–8588. [Google Scholar]
- Daugman, J.G. High confidence visual recognition of persons by a test of statistical independence. IEEE Trans. Pattern Anal. Mach. Intell. 1993, 15, 1148–1161. [Google Scholar] [CrossRef]
- Wildes, R.P. Iris Recognition: An Emerging Biometric Technology. Proc. IEEE 1997, 85, 1348–1363. [Google Scholar] [CrossRef]
- UBIRIS_Database. Available online: https://iris.di.ubi.pt/ (accessed on 29 May 2023).
- Nsaif, A.K.; Ali, S.H.M.; Nseaf, A.K.; Jassim, K.N.; Al-Qaraghuli, A.; Sulaiman, R. Robust and Swift Iris Recognition at distance based on novel pupil segmentation. J. King Saud Univ. Comput. Inf. Sci. 2022, 34, 9184–9206. [Google Scholar] [CrossRef]
- Donida Labati, R.; Muñoz, E.; Piuri, V.; Ross, A.; Scotti, F. Non-ideal iris segmentation using Polar Spline RANSAC and illumination compensation. Comput. Vis. Image Underst. 2019, 188, 102787. [Google Scholar] [CrossRef]
- Nguyen, K.; Fookes, C.; Jillela, R.; Sridharan, S.; Ross, A. Long range iris recognition: A survey. Pattern Recognit. 2017, 72, 123–143. [Google Scholar] [CrossRef]
- Jan, F. Development and Analysis of Robust Iris Segmentation Algorithms for Non Ideal Iris Recognition System. Ph.D. Thesis, COMSATS Univeristy Islamabad, Islamabad, Pakistan, 2014. [Google Scholar]
- Jan, F.; Ahmed, M.I.B.; Min-Allah, N. Databases for Iris Biometric Systems: A Survey. SN Comput. Sci. 2020, 1, 324. [Google Scholar] [CrossRef]
- Kumari, P.; Seeja, K.R. Periocular biometrics: A survey. J. King Saud Univ. Comput. Inf. Sci. 2019, 34, 1086–1097. [Google Scholar] [CrossRef]
- Daugman, J. How Iris Recognition Works. IEEE Trans. Circuits Syst. Video Technol. 2004, 14, 21–30. [Google Scholar] [CrossRef]
- Kim, K.W.; Hong, H.G.; Nam, G.P.; Park, K.R. A Study of Deep CNN-Based Classification of Open and Closed Eyes Using a Visible Light Camera Sensor. Sensors 2017, 17, 1534. [Google Scholar] [CrossRef] [PubMed]
- Al-Shakarchy, N.D.; Israa Hadi, A. Open and Closed Eyes Classification in Different Lighting Conditions Using New Convolution Neural Networks Architecture. J. Theor. Appl. Inf. Technol. 2019, 97, 1970–1979. [Google Scholar]
- Jian, L.; Li, Z.; Yang, X.; Wu, W.; Ahmad, A.; Jeon, G. Combining Unmanned Aerial Vehicles With Artificial-Intelligence Technology for Traffic-Congestion Recognition: Electronic Eyes in the Skies to Spot Clogged Roads. IEEE Consum. Electron. Mag. 2019, 8, 81–86. [Google Scholar] [CrossRef]
- Yiu, Y.-H.; Aboulatta, M.; Raiser, T.; Ophey, L.; Flanagin, V.L.; zu Eulenburg, P.; Ahmadi, S.-A. DeepVOG: Open-source pupil segmentation and gaze estimation in neuroscience using deep learning. J. Neurosci. Methods 2019, 324, 108301–108307. [Google Scholar] [CrossRef] [PubMed]
- Le, A.D.; Nguyen, H.T.; Nakagawa, M. An End-to-End Recognition System for Unconstrained Vietnamese Handwriting. SN Comput. Sci. 2019, 1, 7. [Google Scholar] [CrossRef]
- Hajjami, A.; Khalid, A.; Arsalane, Z. Iris Localisation and segmentation using Convolutional neural network. In Proceedings of the 2019 Third International Conference on Intelligent Computing in Data Sciences (ICDS), Marrakech, Morocco, 28–30 October 2019; pp. 1–6. [Google Scholar]
- Ribeiro, E.; Uhl, A.; Alonso-Fernandez, F. Iris super-resolution using CNNs: Is photo-realism important to iris recognition? IET Biom. 2019, 8, 69–78. [Google Scholar] [CrossRef]
- Al-Waisy, A.S.; Qahwaji, R.; Ipson, S.; Al-Fahdawi, S.; Nagem, T.A.M. A multi-biometric iris recognition system based on a deep learning approach. Pattern Anal. Appl. 2018, 21, 783–802. [Google Scholar] [CrossRef]
- Hollingsworth, K.; Bowyer, K.W.; Flynn, P.J. The Best Bits in an Iris Code. IEEE Trans. Pattern Anal. Mach. Intell. 2009, 31, 964–973. [Google Scholar] [CrossRef]
- Bowyer, K.W.; Hollingsworth, K.; Flynn, P.J. Image Understanding for Iris Biometrics: A Survey. Comput. Vis. Image Underst. 2008, 110, 281–307. [Google Scholar] [CrossRef]
- ISO/IEC 19794-6; Information Technology—Biometric Data Interchange Formats—Part 6: Iris Image Data. International Organization for Standardization: Geneva, Switzerland, 2011.
- Proença, H.; Alexandre, L.A. Iris Recognition: Analysis of the Error Rates Regarding the Accuracy of the Segmentation Stage. Image Vis. Comput. 2010, 28, 202–213. [Google Scholar] [CrossRef]
- Proença, H.; Alexandre, L.A. UBIRIS: A Noisy Iris Image Database. In Proceedings of the International Conference on Image Analysis and Processing (ICIAP), Cagliari, Italy, 6–8 September 2005. [Google Scholar]
- Li, Y.-H.; Huang, P.-J.; Juan, Y. An Efficient and Robust Iris Segmentation Algorithm Using Deep Learning. Mob. Inf. Syst. 2019, 2019, 4568929. [Google Scholar] [CrossRef]
- ImageNet. Available online: https://www.image-net.org/ (accessed on 29 May 2023).
- Fan, Y.; Li, H.; Bao, Y.; Xu, Y. Cycle-consistency-constrained few-shot learning framework for universal multi-type structural damage segmentation. Struct. Health Monit. 2026, 25, 874–893. [Google Scholar] [CrossRef]
- Xu, Y.; Zhang, C.; Li, H. Transformer-based large vision model for universal structural damage segmentation. Autom. Constr. 2025, 176, 106256. [Google Scholar] [CrossRef]
- Tong, Y.; Zeng, Y.; Lu, Y.; Huang, Y.; Jin, Z.; Wang, Z.; Wang, Y.; Zang, X.; Chang, L.; Mu, W.; et al. Deep learning-enhanced microwell array biochip for rapid and precise quantification of Cryptococcus subtypes. View 2024, 5, 20240032. [Google Scholar] [CrossRef]
- Tian, X.; Jiang, Y.; Zhu, S.; Liu, X.; Anwaier, A.; Ye, S.; Chang, K.; Qu, Y.; Gu, Y.J.; Zhang, H.; et al. A multimodal deep learning framework for predicting sunitinib response in advanced clear cell renal cell carcinoma. View 2025, 7, 20250157. [Google Scholar] [CrossRef]
- Mathworks. Available online: http://www.mathworks.com/ (accessed on 29 May 2023).
- Min-Allah, N.; Qureshi, M.B.; Jan, F.; Alrashed, S.; Taheri, J. Deployment of real-time systems in the cloud environment. J. Supercomput. 2020, 77, 2069–2090. [Google Scholar] [CrossRef]
- Jan, F.; Min-Ullah, N. An effective iris segmentation scheme for noisy images. Biocybern. Biomed. Eng. 2020, 40, 1064–1080. [Google Scholar] [CrossRef]
- MMU_Iris_Database. Available online: https://www.kaggle.com/datasets/naureenmohammad/mmu-iris-dataset (accessed on 29 May 2023).
- IITD_Iris_Databases. Available online: https://www4.comp.polyu.edu.hk/~csajaykr/IITD/Database_Iris.htm (accessed on 29 May 2023).
- CASIA_Database. Available online: https://www.kaggle.com/swoyam2609/datasets (accessed on 14 April 2026).
- SGGSIE&T_iris_database. Available online: https://sggs.ac.in/home/page/electronics-and-telecommunication-engineeringl (accessed on 14 April 2026).
- CEW_datset. Closed Eyes in the Wild (CEW). Available online: http://parnec.nuaa.edu.cn/_upload/tpl/02/db/731/template731/pages/xtan/ClosedEyeDatabases.html (accessed on 29 May 2023).
- Jan, F.; Usman, I.; Agha, S. Iris localization in frontal eye images for less constrained iris recognition systems. Digit. Signal Process. 2012, 22, 971–986. [Google Scholar] [CrossRef]
- Jan, F.; Alrashed, S.; Min-Allah, N. Iris segmentation for non-ideal Iris biometric systems. Multimed. Tools Appl. 2021, 83, 15223–15251. [Google Scholar] [CrossRef]
- Ibrahim, M.T.; Khan, T.M.; Khan, S.A.; Khan, M.A.; Guan, L. Iris localization using local histogram and other image statistics. Opt. Lasers Eng. 2012, 50, 645–654. [Google Scholar] [CrossRef]
- Khan, T.M.; Aurangzeb Khan, M.; Malik, S.A.; Khan, S.A.; Bashir, T.; Dar, A.H. Automatic localization of pupil using eccentricity and iris using gradient based method. Opt. Lasers Eng. 2011, 49, 177–187. [Google Scholar] [CrossRef]
- Daugman, J. New Methods in Iris Recognition. IEEE Trans. Syst. Man Cybern. Part B Cybern. 2007, 37, 1167–1175. [Google Scholar] [CrossRef] [PubMed]


















| P-DCNN Models | Depth | Size (MB) | Trainable Parameters (Millions) | Image Dimensions (Pixels) |
|---|---|---|---|---|
| SqueezeNet | 18 | 5.2 | 1.24 | |
| GoogleNet | 22 | 27 | 7.0 | |
| Inception-v3 | 48 | 89 | 23.9 | |
| DenseNet201 | 201 | 77 | 20.0 | |
| MobilenetV2 | 53 | 13 | 3.5 | |
| ResNet18 | 18 | 44 | 11.7 | |
| ResNet50 | 50 | 96 | 25.6 | |
| ResNet101 | 101 | 167 | 44.6 | |
| Xception | 71 | 85 | 22.9 | |
| InceptionResNet-V2 | 164 | 209 | 55.9 | |
| ShuffleNet | 50 | 5.4 | 1.4 | |
| NASNetMobile | * | 20 | 5.3 | |
| NASNet-Large | * | 332 | 88.9 | |
| DarkNet19 | 19 | 78 | 20.8 | |
| DarkNet53 | 53 | 155 | 41.6 | |
| EfficientNet-b0 | 82 | 20 | 5.3 | |
| AlexNet | 8 | 227 | 61.0 | |
| VGG16 | 16 | 515 | 138 | |
| VGG19 | 19 | 535 | 144 |
| Hyperparameter | Value (or Range) | Comment |
|---|---|---|
| [35,39,44] | It is a subset of the training dataset, which is used to evaluate the gradient of the loss function and update the weights. | |
| [8,12,16] | An epoch means a full pass of the target data. | |
| [sgdm, rmsprop, adam] | Sgdm: it uses the Stochastic Gradient Descent with Momentum (SGDM) optimizer. rmsprop: it uses the RMSProp optimizer. Adam: it uses the Adam optimizer. | |
| [0.1, 0.001, 0.0001] | It is needed by Solver. | |
| [1,5,10] | Software will multiply this factor by the global learning rate while finding the learning rate for weights in this layer. | |
| [1,5,10] | Software will multiply this factor by the global learning rate while figuring out the learning rate for the biases in this layer. |
| Dataset | Illumination | Size (Pixels) | Images | Non-Ideal Factors | Setup |
|---|---|---|---|---|---|
| MMU V1.0 | NIR | 460 | Specular reflections, off-axis irises, and eyeglasses. | I | |
| MMU V2.0 | NIR | 995 | Off-axis irises, non-uniform illumination, blurring, reflections, contact lenses, eyelids, eyelashes, hair, and eyeglasses. | LC | |
| IITD V1.0 | NIR | 1120 | Contact lenses, reflections, focus, rotated iris, eyelids, eyelashes, and eyebrows. | LC | |
| CASIA-IrisV3-Interval | NIR | 2639 | Reflections, non-uniform illumination, partially open eye, blurring, natural and cosmetic eyelashes, eyeglasses, low-contrast, hair, eyelids, rotated-iris, non-circular boundaries, and lenses. | LC | |
| CASIA-IrisV3-Twins | 3118 | ||||
| CASIA-IrisV3-Lamp | 16,212 | ||||
| CASIA-IrisV4-Thousand | NIR | 20,000 | Synthesized images, defocus, reflections, non-uniform illumination, eyeglasses, eyelids, eyelashes, face images, tilted face images, and hair. | LC/UC | |
| CASIA-IrisV4-Syn | 10,000 | ||||
| CASIA-IrisV4-Distance | 2567 | ||||
| UBIRIS V1.0 | NIR/VS | ; | 1877 | Close eyes, low contrast, and reflections. | LC/UC |
| UBIRIS V2.0 | VS | 1102 | Low contrast, closed eye, reflections, off-axis and off-angle irises, and eyebrows. | LC/UC | |
| SGGSIE&T Iris Image Dataset | NIR | 1200 | Eyebrows, eyelids, off-axis and off-angle irises, and reflections. | LC | |
| CEW Dataset | VS | Non-uniform | 2423 face images | Closed/open eyes, eyeglasses, contact lenses, eyelids, eyelashes, off-axis and off-angle irises, blur, low resolution, and poor contrast. | UC |
| Hyperparameter | Value (Range) | Comment |
|---|---|---|
| [35,39] | A subset of the training dataset is used to evaluate the gradient of the loss function and update the weights. | |
| [8] | An epoch means a full pass of the target data. | |
| [sgdm] | Sgdm: Stochastic Gradient Descent with Momentum (SGDM) optimizer. | |
| [0.001, 0.0001] | It is needed by the Solver, i.e., sgdm. | |
| [10] | Software will multiply this factor by the global learning rate to find the learning rate for weights in this layer. | |
| [10] | Software multiplies this factor by the global learning rate to find out the learning rate for biases in this layer. |
| BM Model | TA (%) | VA (%) | (min) | ||||||
|---|---|---|---|---|---|---|---|---|---|
| GoogleNet | 100.00 | 99.40 | sgdm | 10 | 10 | 0.001 | 8 | 35 | 10.36 |
| SqueezeNet | 100.00 | 99.70 | sgdm | 10 | 10 | 0.001 | 8 | 35 | 4.55 |
| MobileNetV2 | 100.00 | 99.65 | sgdm | 10 | 10 | 0.001 | 8 | 40 | 16.78 |
| ResNet18 | 100.00 | 99.40 | sgdm | 10 | 10 | 0.001 | 8 | 35 | 9.82 |
| ResNet50 | 100.00 | 99.40 | sgdm | 10 | 10 | 0.001 | 8 | 40 | 25.53 |
| ResNet101 | 97.14 | 99.40 | sgdm | 10 | 10 | 0.001 | 8 | 35 | 47.75 |
| Inception-v3 | 100.00 | 98.89 | sgdm | 10 | 10 | 0.001 | 8 | 35 | 39.18 |
| InceptionResNet-V2 | 100.00 | 99.45 | sgdm | 10 | 10 | 0.001 | 8 | 40 | 78.53 |
| AlexNet | 100.00 | 99.40 | sgdm | 10 | 10 | 0.0001 | 8 | 35 | 3.82 |
| VGG16 | 97.14 | 99.40 | sgdm | 10 | 10 | 0.001 | 8 | 35 | 50.1 |
| VGG19 | 100.00 | 98.89 | sgdm | 10 | 10 | 0.0001 | 8 | 35 | 62.1 |
| EfficientNet-b0 | 100.00 | 99.40 | sgdm | 10 | 10 | 0.001 | 8 | 35 | 23.92 |
| DenseNet201 | 100.00 | 98.89 | sgdm | 10 | 10 | 0.0001 | 8 | 35 | 54.4 |
| Xception | 100.00 | 98.89 | sgdm | 10 | 10 | 0.001 | 8 | 35 | 91.47 |
| ShuffleNet | 100.00 | 98.89 | sgdm | 10 | 10 | 0.001 | 8 | 35 | 8.26 |
| NASNet-Large | Exhibited memory outage in every trial | ||||||||
| NASNet-Mobile | 97.14 | 98.33 | sgdm | 10 | 10 | 0.001 | 8 | 35 | 34.33 |
| DarkNet19 | 100.00 | 99.50 | sgdm | 10 | 10 | 0.001 | 8 | 40 | 19.93 |
| DarkNet53 | 100.00 | 99.43 | sgdm | 10 | 10 | 0.001 | 8 | 35 | 45.27 |
| BM Model | TA (%) | VA (%) | (min) | ||||||
|---|---|---|---|---|---|---|---|---|---|
| GoogleNet | 97.14 | 95.56 | sgdm | 10 | 10 | 0.0001 | 8 | 35 | 9.91 |
| SqueezeNet | 100.00 | 98.33 | sgdm | 10 | 10 | 0.0001 | 8 | 40 | 4.82 |
| MobileNetV2 | 100.00 | 96.11 | sgdm | 10 | 10 | 0.0001 | 8 | 35 | 16.38 |
| ResNet18 | 100.00 | 96.67 | sgdm | 10 | 10 | 0.0001 | 8 | 35 | 9.70 |
| ResNet50 | 100.00 | 97.22 | sgdm | 10 | 10 | 0.0001 | 8 | 35 | 25.42 |
| ResNet101 | 97.14 | 96.11 | sgdm | 10 | 10 | 0.0001 | 8 | 35 | 42.45 |
| Inception-v3 | 100.00 | 94.44 | sgdm | 10 | 10 | 0.0001 | 8 | 35 | 37.37 |
| InceptionResNet-V2 | 97.14 | 93.89 | sgdm | 10 | 10 | 0.0001 | 8 | 35 | 73.88 |
| AlexNet | 97.50 | 98.33 | sgdm | 10 | 10 | 0.001 | 8 | 40 | 4.99 |
| VGG16 | 50.00 | 50.00 | sgdm | 10 | 10 | 0.001 | 8 | 40 | 68.12 |
| VGG19 | 100.00 | 96.11 | sgdm | 10 | 10 | 0.001 | 8 | 40 | 60.60 |
| EfficientNet-b0 | 97.50 | 92.78 | sgdm | 10 | 10 | 0.0001 | 8 | 40 | 23.83 |
| DenseNet201 | 100.00 | 98.33 | sgdm | 10 | 10 | 0.001 | 8 | 35 | 51.53 |
| Xception | 95.00 | 93.89 | sgdm | 10 | 10 | 0.0001 | 8 | 40 | 97.68 |
| ShuffleNet | 97.50 | 96.11 | sgdm | 10 | 10 | 0.0001 | 8 | 40 | 14.50 |
| NASNet-Large | Exhibited memory outage in every trial | ||||||||
| NASNet-Mobile | 92.00 | 92.78 | sgdm | 10 | 10 | 0.0001 | 8 | 40 | 55.53 |
| DarkNet19 | 100.00 | 98.89 | sgdm | 10 | 10 | 0.0001 | 8 | 40 | 32.45 |
| DarkNet53 | 100.00 | 98.89 | sgdm | 10 | 10 | 0.0001 | 8 | 40 | 47.34 |
| Ref | P-DCNN Model | Open Eyes (%) | Fully/Nearly Closed Eyes (%) | Average Time (ms) |
|---|---|---|---|---|
| [30] | ResNet50 | 98.4 | 97% | 250 |
| Proposed | SqueezeNet | 99.6 | 99% | 150 |
| Mode | Accuracy (%) | Time (s) |
|---|---|---|
| Without SqueezeNet | 48 | 50 |
| With SqueezeNet | 99.5 | 0.6 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Jan, F. Robust Iris Segmentation with Deep CNNs for Detecting Fully or Nearly Closed Eyes in Non-Ideal Biometric Systems. Computers 2026, 15, 253. https://doi.org/10.3390/computers15040253
Jan F. Robust Iris Segmentation with Deep CNNs for Detecting Fully or Nearly Closed Eyes in Non-Ideal Biometric Systems. Computers. 2026; 15(4):253. https://doi.org/10.3390/computers15040253
Chicago/Turabian StyleJan, Farmanullah. 2026. "Robust Iris Segmentation with Deep CNNs for Detecting Fully or Nearly Closed Eyes in Non-Ideal Biometric Systems" Computers 15, no. 4: 253. https://doi.org/10.3390/computers15040253
APA StyleJan, F. (2026). Robust Iris Segmentation with Deep CNNs for Detecting Fully or Nearly Closed Eyes in Non-Ideal Biometric Systems. Computers, 15(4), 253. https://doi.org/10.3390/computers15040253

