Pupil and Glint Detection Using Wearable Camera Sensor and Near-Infrared LED Array

Wang, Jianzhong; Zhang, Guangyue; Shi, Jiadong

doi:10.3390/s151229792

Open AccessArticle

Pupil and Glint Detection Using Wearable Camera Sensor and Near-Infrared LED Array

by

Jianzhong Wang

,

Guangyue Zhang

and

Jiadong Shi

^*

School of Mechatronical Engineering, Beijing Institute of Technology, 5 South Zhongguancun Street, Haidian District, Beijing 100081, China

^*

Author to whom correspondence should be addressed.

Sensors 2015, 15(12), 30126-30141; https://doi.org/10.3390/s151229792

Submission received: 13 October 2015 / Revised: 17 November 2015 / Accepted: 27 November 2015 / Published: 2 December 2015

(This article belongs to the Section Physical Sensors)

Download

Browse Figures

Versions Notes

Abstract

:

This paper proposes a novel pupil and glint detection method for gaze tracking system using a wearable camera sensor and near-infrared LED array. A novel circular ring rays location (CRRL) method is proposed for pupil boundary points detection. Firstly, improved Otsu optimal threshold binarization, opening-and-closing operation and projection of 3D gray-level histogram are utilized to estimate rough pupil center and radius. Secondly, a circular ring area including pupil edge inside is determined according to rough pupil center and radius. Thirdly, a series of rays are shot from inner to outer ring to collect pupil boundary points. Interference points are eliminated by calculating gradient amplitude. At last, an improved total least squares is proposed to fit collected pupil boundary points. In addition, the improved total least squares developed is utilized for the solution of Gaussian function deformation to calculate glint center. The experimental results show that the proposed method is more robust and accurate than conventional detection methods. When interference factors such as glints and natural light reflection are located on pupil contour, pupil boundary points and center can be detected accurately. The proposed method contributes to enhance stability, accuracy and real-time quality of gaze tracking system.

Keywords:

wearable camera sensor; pupil detection; glint detection; circular ring rays location; total least squares fitting; Gaussian fitting

1. Introduction

Human beings acquire 80%~90% of outside information from our eyes. Humans’ visual perception information can be acquired through eye gaze tracking. With the increasing development of computer/machine vision technology, gaze tracking technology has been more and more widely applied in fields of medicine [1], production tests [2], human-machine interaction [3,4], aviation military [5,6], etc.

As one of traditional gaze tracking methods [7,8,9,10,11,12], the pupil center-corneal reflection (PCCR) technique has been developed and improved increasingly in recent years [13,14,15,16,17,18]. Pupil and glint (corneal reflection) center detection plays a crucial role on gaze tracking methods based on PCCR. There are always interference factors such as eyelashes, eyelids, shadows and natural light reflection in the images acquired by a CCD camera, which will cause false boundary points around pupil contour. In order to ensure the accuracy of gaze estimation, robust and accurate method of pupil and glint detection is essential.

Previous scholars have done a great many research works on pupil and glint detection. Ebisawa [19] proposes a pupil detection technique using two alternate infrared light sources and image difference of bright and dark eye image. Bright/dark eye image is acquired by switching on light source in coaxial/uncoaxial with the camera during add/even field alternatively, due to which the sampling time is limited. The glint position stays almost fixed. To detect it, pupil brightness should be as low as possible. Although the image difference method is simple, switching on/off of light sources may influence its stability. To overcome the limitation of this technique, methods utilizing single eye image are proposed continuously.

In [13], in order to obtain accurate pupil center position, double ellipse fitting (rough and detailed) are performed to eliminate false boundary points. It is difficult to eliminate false boundary points around pupil contour and double ellipse fitting cost a long time. The glint is detected by searching near the pupil. Its centroid is then calculated as center position. The uncertain searching time and result of glint can lead to instability of the method. Yoo et al. [20] acquire rough pupil bound by iterative projection. Snakes are utilized to converge to the boundary of pupil. Elimination of false boundary points is not considered. Glint searching region is limited by rough pupil bound. At last, pupil and glint center position are determined by ellipse fitting. Gwon et al. [21] locate approximate pupil area using CED method, then precise pupil center is obtained by calculating geometric center of black pixels. Before pupil detection, glints are erased by neighboring pixels in horizontal direction. The erasion causes error to pixel points around pupil contour and influences accuracy of pupil center location. To better locate pupil boundary, Li et al. [22] develop a feature-based method. In the process of feature detection, pupil contour candidates are detected along a series of rays shooting from a best guess of pupil center and marked with crosses. RANSAC is applied to differentiate pupil contour points (inliers) and interference points (outliers). When interference factors such as glints and natural light reflection locate on or around pupil contour, part of interference points and pupil contour points are mixed together. In this case, RANSAC is not capable enough to differentiate them. The location accuracy of pupil center is affected. Krishnamoorthi and Annapoorani [23] propose a boundary extraction technique to localize pupil. Orthogonal polynomials model is adopted to analysis the structure of an eye image. Hartley’s statistical hypothesis test is employed in edge map extraction. A where-to-go approach is proposed to find pupil boundary points with the assistant of weightage assignment. Although the algorithm can locate pupil boundary points accurately, it has a limitation of boundary assumption.

The remainder of this paper is organized as follows: Section 2 presents the proposed method in detail. Section 3 describes the experiments and shows the experimental results. Section 4 concludes the whole work.

2. Proposed Method

A novel and robust method of pupil and glint detection using wearable camera sensor and near-infrared LED array for gaze tracking system is proposed in this paper. Compared with original Starburst, the proposed circular ring rays location(CRRL) method has higher stability, accuracy and real-time quality. This method overcomes the location uncertainty of initial shooting point of rays. The process of shooting rays back towards the start point to collect more pupil boundary points is omitted. RANSAC is also omitted for the reason that the interference points can be eliminated effectively. Pupil center can be detected accurately when interference points are located on or around pupil contour. Improved Otsu method is employed to acquire the eye’s binary image. Part of the remainder interference factors (including eyelashes and eyelids) are eliminated by opening-and-closing operation with structure elements of different size. Projections of 3D gray-level histogram are utilized to estimate rough pupil radius and center position. The circular ring area is determined by provisional pupil radius and center. A series of rays with equal gap are shot from the inner to outer ring to detect pupil boundary points by calculating gradient amplitude. Gradient amplitude of each pixel is used to eliminate false boundary points. Spline interpolation is performed on the neighborhood of boundary points to obtain subpixel-precise ones. Improved total least squares is developed to fit ellipse and then pupil center position is calculated through elliptic equation fitted. Because the gray levels of glint pixels are higher than anywhere else, rough glint region is estimated by binarization with a fixed threshold level. According to glint’s illumination intensity (suited for Gaussian distribution), Gaussian function deformation solved by improved total least squares is utilized to calculate glint center.

2.1. Proposed Gaze Tracking Device

In this study, we develop a wearable gaze tracking device composed by a helmet, a monitor, an array of four near-infrared light emitting diodes (NIR LEDs) and a microspur camera shown in Figure 1. Considering the imaging distance is limited between 3~5 cm, a microspur camera is adopted to acquire eye image. The image resolution is 640 × 480 pixels (CCD sensor). The wavelength of NIR LED is 850 nm and the power is less than 5mw. The experimental system brings no harm to human eyes [24].

Figure 1. Proposed gaze tracking device.

2.2. Pupil Detection

2.2.1. Binarization and Opening-and-Closing Operation

An improved Otsu method is employed to obtain eye binary image in this paper. Proposed by Otsu first in 1979, the Otsu method is based on adaptive threshold selecting [25]. The original eye image is shown in Figure 2a. Gray-level histogram of eye image is shown in Figure 2b.

Figure 2. (a) Original eye image; (b) Gray-level histogram of eye image.

Assuming number of pixels with gray level

i

is

n_{i}

in eye image, all gray levels are divided into 3 groups, as shown in Figure 2b:

\begin{array}{l} G_{0} = {0 ~ T_{1}} \\ G_{1} = {T_{1} + 1 ~ T_{2}} \\ G_{2} = {T_{2} + 1 ~ 255} \end{array}

(1)

Group

G_{0}

contains mainly gray levels of black area such as pupil and eyelashes. Group

G_{1}

contains mainly gray levels of iris and shadows. Group

G_{2}

contains mainly gray levels of cornea and skin around. Assuming the respective occurring probability of

G_{0}

,

G_{1}

,

G_{2}

is

ω_{0}

,

ω_{1}

,

ω_{2}

, the corresponding gray level is

h_{0}

,

h_{1}

,

h_{2}

:

{\begin{array}{l} ω_{0} = \sum_{i = 0}^{T_{1}} p_{i}, h_{0} = \frac{\sum_{i = 0}^{T_{1}} i p_{i}}{ω_{0}} \\ ω_{1} = \sum_{i = T_{1} + 1}^{T_{2}} p_{i}, h_{1} = \frac{\sum_{i = T_{1} + 1}^{T_{2}} i p_{i}}{ω_{1}} \\ ω_{2} = \sum_{i = T_{2} + 1}^{255} p_{i} = 1 - ω_{0} - ω_{1}, h_{2} = \frac{\sum_{i = T_{2} + 1}^{255} i p_{i}}{ω_{2}} = \frac{h - ω_{0} h_{0} - ω_{1} h_{1}}{ω_{2}} \end{array}

(2)

p_{i} = n_{i} / N

is the occurring probability of each gray level.

N = \sum_{i = 0}^{255} n_{i} g (T_{1}, T_{2})

is the total pixel number.

h = \sum_{i = 0}^{255} i p_{i}

is average gray level of eye image.

The class variances are defined as

{\begin{array}{l} σ_{0}^{2} = \sum_{i = 0}^{T_{1}} \frac{{(i - h_{0})}^{2} p_{i}}{ω_{0}} \\ σ_{1}^{2} = \sum_{i = T_{1} + 1}^{T_{2}} \frac{{(i - h_{1})}^{2} p_{i}}{ω_{1}} \\ σ_{2}^{2} = \sum_{i = T_{2} + 1}^{255} \frac{{(i - h_{2})}^{2} p_{i}}{ω_{2}} \end{array}

(3)

The within-class variance is defined as

σ_{W}^{2} = ω_{0} σ_{0}^{2} + ω_{1} σ_{1}^{2} + ω_{2} σ_{2}^{2}

(4)

We develop an improved and fast solution method of optimal thresholds. According to Equations (3) and (4), within-class variance is transformed into integral form in Equation (5).

σ_{W}^{2} = \int_{0}^{T_{1}} \frac{{(i - h_{0})}^{2} p_{i}}{ω_{0}} + \int_{T_{1} + 1}^{T_{2}} \frac{{(i - h_{1})}^{2} p_{i}}{ω_{1}} + \int_{T_{2} + 1}^{255} \frac{{(i - h_{2})}^{2} p_{i}}{ω_{2}}

(5)

Partial derivative on

T_{1}

and

T_{2}

is calculated respectively on both sides of Equation (5). The calculation result is shown in Equation (6).

{\begin{array}{l} 2 T_{1} - h_{0} - h_{1} = 0 \\ 2 T_{2} - h_{1} - h_{2} = 0 \end{array}

(6)

Formula to solve threshold in Otsu method is expanded as Equation (7).

g (T_{1}, T_{2}) = Arg \underset{0 < T_{1} < T_{2} < 255}{Max} {ω_{0} {(h_{0} - h)}^{2} + ω_{1} {(h_{1} - h)}^{2} + ω_{2} {(h_{2} - h)}^{2}}

(7)

According to Equations (6) and (7), optimal thresholds can be solved. For each pixel point in the original eye image, mean gray-level of a 3 × 3 neighboring region around it is calculated to substitute its original gray-level. The occurring probabilities of new gray-levels are calculated and utilized to solve optimal segmentation threshold

T_{1}

and

T_{2}

according to Equations (6) and (7). According to the distributing regularity of eye image’s gray-level histogram, value of

T_{1}

is limited between 0~50, value of

T_{1}

is limited between

T_{1}

~150. The maximum value of

g (T_{1}, T_{2})

is calculated according to Equation (7) and the corresponding

(T_{1}, T_{2})

is the optimal threshold solved.

The computational complexity of the new method to solve optimal threshold is decreased. As shown in Table 1, the segmentation time of improved method is less than that of original Otsu, which contributes to the real-time quality of eye gaze tracking.

Table 1. Segmentation time.

**Table 1.** Segmentation time.
Method	Original Otsu	Improved Otsu
Time/ms	32.4	17.1

In order to extract the pupil, threshold

T_{1}

is utilized in the process of binarization. Eye’s binary image is shown in Figure 3.

Figure 3. Eye’s binary image with Otsu optimal threshold.

To eliminate interference points (mainly remnant eyelashes and eyelids) clearly, opening-and-closing operation with structure elements of different size are employed. According to the shape and size of interference factors shown in Figure 3, a 0.3

T_{1}

× 0.3

T_{1}

square structure element is utilized in the process of opening operation, and a 0.7

T_{1}

× 0.7

T_{1}

square structure element is utilized in the process of closing operation. The operating result is shown in Figure 4.

Figure 4. Result of opening-and-closing operation.

2.2.2. Rough Location of Pupil Area and Center

Pupil image acquired through opening-and-closing operation presents an elliptical shape (irregular at glints and natural light reflection). 3D gray-level histogram of opening-and-closing operation result is shown in Figure 5a. Projection along

x

and

y

axis of 3D gray-level histogram is shown in Figure 5b. Rough location of pupil area and center position is determined by distribution of gray level in projection image. Rough pupil area locates in a rectangular box with length

l_{2}

and width

l_{4}

. Estimated pupil center is defined as

o_{p}^{'} = (l_{1} + l_{2} / 2, l_{3} + l_{4} / 2)

. Estimated pupil radius is defined as

r_{p}^{'} = (l_{2} + l_{4}) / 4

.

Figure 5. (a) 3D gray-level histogram of opening-and-closing operation result; (b) Rough location of pupil area and center.

2.2.3. Collection of Pupil Boundary Points

A novel circular ring rays location (CRRL) method is proposed for pupil boundary points detection based on a modified Starburst. The proposed method has the following advantages than the original Starburst method. First, a series of rays are shot from inner circular ring to outer circular ring in proposed method instead of shooting rays from a guessed point to detected point. In original Starburst, a second shooting of rays is needed to collect more pupil counter candidates. In our proposed method, shooting rays once can collect sufficient pupil boundary points to fit ellipse, which saves the period of pupil boundary points collection. The style of shooting rays can also save calculation time because the rays shot are shorter than those shot in original Starburst method. Second, RANSAC is utilized in original Starburst to distinguish and separate pupil contour points (inliers) and interference points (outliers), which costs much time. We calculate the gradient amplitude at pixels neighboring pupil boundary utilizing pixel gray values of pupil and iris region in advance. Then a threshold of gradient amplitude is set to detect pupil boundary points. Number of pupil boundary points detected on each ray is counted to eliminate interference points. The experimental results show that the method for interference points elimination is suitable and effective in CRRL method. Third, cubic spline interpolation is utilized neighboring collected pupil boundary points to determine subpixel-precise pupil boundary points, which contributes to the accuracy enhancement of pupil center location.

Collection steps of pupil boundary points are presented in detail below:

Input: Gray-level eye image.

Output: Point set of pupil boundary points.

Step 1: Building of circular ring area. As shown in Figure 6, in order to build a circular ring area including pupil boundary inside, estimated pupil center

o_{p}^{'}

is taken as center of inner and outer ring (green line) with respective radius

0.5 r_{p}^{'}

and

1.5 r_{p}^{'}

.

Step 2: Location of pupil boundary points. 36 rays (with equal gap

10^{°}

) are shot from inner to outer circular ring. Gradient

\nabla f = {[\begin{matrix} g_{x} & g_{y} \end{matrix}]}^{T}

is calculated at each pixel location

(x, y)

along shooting direction of each ray.

M (x, y) = \sqrt{g_{x}^{2} + g_{y}^{2}}

is calculated as gradient amplitude. According to variation range of gradient amplitude neighboring pupil contour, a threshold of gradient amplitude is set as

∆ \in [1.3 δ, 1.5 δ] (δ = T_{2} - T_{1})

in advance to select pupil boundary points. If gradient amplitude at pixel location

(x, y)

along shooting direction is within the range of

∆

, pixel

(x, y)

is recorded as one of pupil boundary points. Located pixel points matching threshold

δ

on each ray are counted.

Step 3: Elimination of interference points. When interference factors (glints and natural light reflection) are located on or around pupil contour, number of pixels matching threshold

δ

on the ray may be more than 1. In this case, all boundary points recorded on the ray are eliminated to avoid interference caused by glints and natural light reflection.

Step 4: Subpixel-precise location of pupil boundary points. To enhance location accuracy of pupil boundary points, cubic spline interpolation [26] is utilized neighboring collected pupil boundary points in Step 2 to determine subpixel-precise pupil boundary points.

Step 5: Mark of pupil boundary points. As shown in Figure 6, determined pupil boundary points are marked with yellow “+”. All the determined candidates of pupil boundary points are collected into one point set for ellipse fitting.

Figure 6. Extraction result of pupil boundary points.

2.2.4. Ellipse Fitting

Total least squares (TLS) [27,28] was proposed first in 1980. An improved total least squares is developed in this paper to fit collected pupil boundary points. Compared with least squares (LS) method, errors of independent and dependent variable are taken into account in the calculating process of total least squares. In TLS, matrix equation

A x = b

is solved by considering errors in both data matrix

A

and observation vector

b

. To compensate errors existed in

A

and

b

, perturbation vector

e

is utilized to perturb observation vector

b

, and simultaneously, perturbation matrix

E

is utilized to perturb observation data matrix

A

. Both

e

and

E

are of minimum amount.

Assuming the elliptic equation of eye pupil is

A x^{2} + B x y + C y^{2} + D x + E y + F = 0

, constraint condition is set as

A + C = 1

[29] in order to obtain higher fitting accuracy. Then elliptic equation is deformed as Equation (8).

B x_{i} y_{i} + C (y_{i}^{2} - x_{i}^{2}) + D x_{i} + E y_{i} + F = - x_{i}^{2}

(8)

where

i = 1, 2, \dots, n

,

n

is the number of pupil boundary points extracted. Errors in pixel position

(x, y)

is defined as

(v_{x}, v_{y})

, the ideal form of Equation (8) is defined as

B (x_{i} y_{i} - v_{x_{i} y_{i}}) + C [(y_{i}^{2} - x_{i}^{2}) - (v_{y_{i}}^{2} - v_{x_{i}}^{2})] + D (x_{i} - v_{x_{i}}) + E (y_{i} - v_{y_{i}}) + F = - (x_{i}^{2} - v_{x_{i}}^{2})

(9)

Transform Equation (8) into matrix form

M τ = Y

(10)

where

M = [\begin{matrix} x_{1} y_{1} & y_{1}^{2} - x_{1}^{2} & x_{1} & y_{1} & 1 \\ x_{2} y_{2} & y_{2}^{2} - x_{2}^{2} & x_{2} & y_{2} & 1 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ x_{i} y_{i} & y_{i}^{2} - x_{i}^{2} & x_{i} & y_{i} & 1 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ x_{n - 1} y_{n - 1} & y_{n - 1}^{2} - x_{n - 1}^{2} & x_{n - 1} & y_{n - 1} & 1 \\ x_{n} y_{n} & y_{n}^{2} - x_{n}^{2} & x_{n} & y_{n} & 1 \end{matrix}]

,

τ = {[\begin{matrix} B & C & D & E & F \end{matrix}]}^{T}

,

Y = {[\begin{matrix} - x_{1}^{2} & - x_{2}^{2} & \dots & - x_{i}^{2} & \dots & - x_{n - 1}^{2} & - x_{n}^{2} \end{matrix}]}^{T}

. Let augmented matrix

H = [- Y, M]

and its singular values

σ_{1} \geq σ_{2} \geq \dots \geq σ_{min}

) are calculated utilizing SVD method. According to the subspace interpretation of total least squares, the total least squares solution of matrix equation

M τ = Y

is deduced as

τ_{TLS} = {(M^{T} M - σ_{min}^{2} I)}^{- 1} M^{T} Y

(11)

where

σ_{min}

is the minimal singular value of augmented matrix

H

. Consequently,

σ_{min}^{2}

is the common variance of each component in perturbation matrix

D = [- e, E]

.

For the reason that row of constant in coefficient matrix

M

cannot be considered in SVD, we propose an improved method for SVD solution. By setting

α_{1 i} = x_{i} y_{i}

,

α_{2 i} = y_{i}^{2} - x_{i}^{2}

,

α_{3 i} = x_{i}

,

α_{4 i} = y_{i}

,

β_{i} = - x_{i}^{2}

, error equation of ellipse can be define as

v_{i} = B x_{1 i} + C x_{2 i} + D x_{3 i} + E x_{4 i} + F - z_{i}

(12)

Here we set

{\begin{array}{l} {\bar{α}}_{r} = \frac{1}{n} \sum_{i = 1}^{n} X_{r i} (r = 1, 2, 3, 4) \\ \bar{β} = \frac{1}{n} \sum_{i = 1}^{n} β_{i} \end{array}

(13)

Therefore, coefficient

F

is described as

F = \bar{β} - {\bar{α}}_{1} B - α_{2} C - {\bar{α}}_{3} D - {\bar{α}}_{4} E = \bar{β} - {\bar{α}}^{T} τ^{'}

(14)

where

\bar{α} = {[\begin{matrix} {\bar{α}}_{1} & {\bar{α}}_{2} & {\bar{α}}_{3} & {\bar{α}}_{4} \end{matrix}]}^{T}

,

τ^{'} = {[\begin{matrix} B & C & D & E \end{matrix}]}^{T}

.

By taking Equation (14) into Equation (12), we acquire

ε = X τ^{'} - Z

(15)

where

ε = [\begin{matrix} v_{1} \\ v_{2} \\ ⋮ \\ v_{n} \end{matrix}]

,

X = [\begin{matrix} x_{11} - {\bar{x}}_{1} & x_{21} - {\bar{x}}_{2} & \dots & x_{41} - {\bar{x}}_{4} \\ x_{12} - {\bar{x}}_{1} & x_{22} - {\bar{x}}_{2} & \dots & x_{42} - {\bar{x}}_{4} \\ ⋮ & ⋮ & \dots & ⋮ \\ x_{1 n} - {\bar{x}}_{1} & x_{2 n} - {\bar{x}}_{2} & \dots & x_{4 n} - {\bar{x}}_{4} \end{matrix}]

,

τ^{'} = [\begin{matrix} B \\ C \\ D \\ E \end{matrix}]

,

Z = [\begin{matrix} z_{1} - \bar{z} \\ z_{2} - \bar{z} \\ ⋮ \\ z_{n} - \bar{z} \end{matrix}]

.

The total least squares solution of matrix equation

ε = X τ^{'} - Z

is described as

τ_{TLS}^{'} = {(X^{T} X - γ_{min}^{2} I)}^{- 1} X^{T} Z

(16)

New augmented matrix is defined as

L = [- Z, X]

. In order to improve the fitting accuracy and stability of TLS, a novel and fast SVD solution method is utilized to acquire the singular values of matrix

L

.

L

is described as SVD format in Equation (17).

L = U Σ V^{T}

(17)

Matrix

Q

is defined as

Q = L^{T} L = {(U Σ V^{T})}^{T} (U Σ V^{T}) = (V Σ U^{T}) (U Σ V^{T}) = V Σ^{2} V^{T}

(18)

Equation (19) shows the multiplication result of different rows

L_{s}, L_{t}

in matrix

L

.

Q_{s t} = {[L_{s}, L_{t}]}^{T} [L_{s}, L_{t}]

(19)

where

1 \leq s \leq 4, 1 \leq t \leq 4, s \neq t

. Eigenvalue matrix

Σ_{s t}

is calculated as

Σ_{s t} = Δ_{s t}^{T} (Q_{s t}) Δ_{s t}

(20)

Then rows of matrix

L

are redefined as

[L_{s}, L_{t}] Δ_{s t}

. Orthogonal transformation is conducted for any two redefined rows of matrix

L

. Non-diagonal elements of matrix

Q

are eliminated. Eigenvalue matrix of

Q

is solved as

Σ^{'} = V^{T} Q V = V^{T} (L^{T} L) V = [\begin{matrix} γ_{1}^{2} & 0 \\ γ_{2}^{2} \\ ⋱ \\ 0 & γ_{m}^{2} \end{matrix}]

(21)

γ_{1}, γ_{2}, \dots, γ_{m}

(

γ_{1} \geq γ_{2} \geq \dots \geq γ_{m}

) are the singular values of matrix

L

.

τ^{'} = {[\begin{matrix} B & C & D & E \end{matrix}]}^{T}

is calculated according to Equation (16). Pupil center can be acquired through Equation (22).

{\begin{matrix} x_{p} = \frac{B E - 2 C D}{4 A C - B^{2}} \\ y_{p} = \frac{B D - 2 A E}{4 A C - B^{2}} \end{matrix}

(22)

where

A = 1 - C

.

The sensitivity of the TLS problem depends on the ratio

r = ({\tilde{σ}}_{p} - σ_{p + 1}) / {\tilde{σ}}_{p}^{'}

.

{\tilde{σ}}_{p}

,

σ_{p + 1}

and

{\tilde{σ}}_{p}^{'}

are the respective least singular value of matrix

X

(or

M

),

L

(or

H

) and

X_{0}

(or

M_{0}

)(coefficient matrix in corresponding LS problem). When the value of ratio

r

is larger, the TLS will be more accurate than LS. During the ellipse fitting of pupil boundary points, the respective ratios

r

of TLS problem solved by SVD and improved SVD are 0.82 and 0.94. The improved TLS achieves a higher accuracy than original TLS. Improved TLS method makes a compensation for errors in pixel location. The fitting result is more closed to the ideal form of elliptic equation (Equation (9)).

The result of ellipse fitting is shown in Figure 7. Red ellipse represents the fitted pupil contour. Red “

•

” represents the center of fitted pupil contour.

Figure 7. Ellipse fitting result.

2.3. Glint Detection

For the reason that the pixel number of glint region is limited and there is halo existing around glint contour, the proposed method for pupil detection is not suitable for glint. Improved Gaussian fitting is utilized to locate glint center.

2.3.1. Rough Location of Glint Region

Because illumination intensity of glint is higher and its gray-levels are near to 255,

t h r e s h o l d = 240

is adopted on binarization of eye image to extract glints. A 2 × 2 square structure element is utilized in the process of opening-and-closing operation to filter binary image. As shown in Figure 8, red rectangular boxes are utilized to locate rough glint regions.

Figure 8. Rough location of glint region.

2.3.2. Gaussian Fitting

Figure 9a shows the enlarged glint region. The 3D gray-level histogram of enlarged glint is shown in Figure 9b. The glint’s illumination intensity suits for Gaussian distribution [30].

Figure 9. (a) Enlarged glint region; (b) 3D gray-level histogram of enlarged glint.

Gaussian function of glint illumination intensity is defined as Equation (23):

I (x, y) = H \cdot e^{- [\frac{{(x - x_{g})}^{2}}{2 σ_{x}^{2}} + \frac{{(y - y_{g})}^{2}}{2 σ_{y}^{2}}]}

(23)

I (x, y)

is the gray-level of pixel

(x, y)

in glint region. As the amplitude of Gaussian distribution,

H

is the highest gray-level in glint region.

(x_{g}, y_{g})

represents the glint center to be calculated.

σ_{x}

an

σ_{y}

is the respective standard deviation of gray-level in horizontal and vertical direction. A logarithmic operation is conducted to Equation (23). The arrangement and deformation result is as follow:

z = a x^{2} + b y^{2} + c x + d y + e

(24)

where

z = ln I (x, y)

,

a = - 1 / 2 σ_{x}^{2}

,

b = - 1 / 2 σ_{y}^{2}

,

c = x_{g} / σ_{x}^{2}

,

d = y_{g} / σ_{y}^{2}

,

e = - x_{g}^{2} / 2 σ_{x}^{2} - y_{g}^{2} / 2 σ_{y}^{2} + ln H

. Subpixel-precise boundary points of glint are extracted with cubic spline interpolation neighboring glint contour. Pixel points inside glint contour are substituted into Equation (24) for calculating. The improved total least squares proposed in Section 2.2.4 is utilized for the solution of overdetermined equations composed by Equation (24). Glint center is calculated according to Equation (25) with the solved value of

a

,

b

,

c

,

d

.

{\begin{matrix} x_{g} = - \frac{c}{2 a} \\ y_{g} = - \frac{d}{2 b} \end{matrix}

(25)

Figure 10 shows the detection result of glint center (marked with green “+”).

Figure 10. Detection result of glint center.

3. Experimental Results

3.1. Pupil Detection

3.1.1. Pupil Detection of Single Subject

The process of pupil detection is shown in Figure 11. Figure 11a-d shows four original eye image with different relative position of pupil and glints acquired from single subject;

a_{1}

-

d_{1}

shows eye binary image utilizing improved Otsu optimal threshold;

a_{2}

-

d_{2}

shows result of opening-and-closing operation with 5 × 5 square structure element;

a_{3}

-

d_{3}

shows extraction result of pupil boundary points (marked with yellow “+”);

a_{4}

-

d_{4}

shows fitting results of pupil (red ellipse). The center of fitted pupil contour is marked with red “

\partial

”.

Table 2 shows the parameters of pupil detection, including threshold

T_{1}

and

T_{2}

, rough pupil center and final pupil center fitted.

Figure 11. (a–d) Original eye image; (a₁–d₁) Eye binary image utilizing improved Otsu optimal threshold; (a₂–d₂) Results of opening-and-closing operation; (a₃–d₃) Extraction result of pupil boundary points; (a₄–d₄) Pupil fitting result.

Table 2. Parameters of pupil detection.

**Table 2.** Parameters of pupil detection.
Eye Image	Threshold $T_{1}$	Threshold $T_{2}$	Rough Pupil Center ( $x_{0}, y_{0}$ )	Final Pupil Center ( $x_{p}, y_{p}$ )
Figure 11a	13	117	(199, 230)	(196.39, 230.88)
Figure 11b	15	120	(311, 212)	(310.66, 209.67)
Figure 11c	15	122	(344, 194)	(344.34, 192.55)
Figure 11d	14	119	(379, 207)	(378.43, 206.37)

3.1.2. Pupil Detection of Different Subjects

In order to verify the applicability of the proposed circular ring rays location(CRRL) method, original eye images of another four different subjects are acquired. The experimental results are shown in Figure 12. In Section 2.2.1, a larger size of structure element is set in process of closing operation than that in opening operation. For subjects with heavy eyelashes and eyelids, different sizes of structure element in opening-and-closing operation can ensure the complete elimination of remnant interference factors caused by eyelashes and eyelids.

Figure 12. (a–d) Original eye image; (a₁–d₁) Eye binary image utilizing improved Otsu optimal threshold; (a₂–d₂) Results of opening-and-closing operation; (a₃–d₃) Extraction result of pupil boundary points; (a₄–d₄) Pupil fitting result.

Table 3 shows the parameters of pupil detection, including threshold

T_{1}

and

T_{2}

, rough pupil center and final pupil center fitted.

Table 3. Parameters of pupil detection.

**Table 3.** Parameters of pupil detection.
Eye Image	Threshold $T_{1}$	Threshold $T_{2}$	Rough Pupil Center ( $x_{0}, y_{0}$ )	Final Pupil Center ( $x_{p}, y_{p}$ )
Figure 12a	22	132	(282, 210)	(284.12, 211.65)
Figure 12b	13	116	(318, 186)	(317.46, 185.34)
Figure 12c	24	133	(292, 186)	(293.59, 185.60)
Figure 12d	11	121	(299, 164)	(299.38, 162.11)

3.2. Glint Detection

Glint detection is implemented for Figure 11a–d and Figure 12a–d. The process of detection is shown in Figure 13. Figure 13

a_{5}

–

d_{5}

,

a_{7}

–

d_{7}

show the rough location of glints after binarization and filtering operation. No. 1,2,3,4 are glint number defined. Figure 13

a_{6}

–

d_{6}

,

a_{8}

–

d_{8}

show the detection result of glints. The glint center is marked with green “+”.

Figure 13. (a₅–d₅) Rough location of glints in Figure 11a–d; (a₆–d₆) Glint detection results in Figure 11a–d; (a₇–d₇) Rough location of glints in Figure 12a–d; (a₈–d₈) Glint detection results in Figure 12a–d.

Table 4 shows the parameters of glint detection in Figure 11a–d and Figure 12a–d.

Table 4. Parameters of glint detection.

**Table 4.** Parameters of glint detection.
	Detected Glint Center ( $x_{g}, y_{g}$ )
Glint Number	1	2	3	4
Figure 11a	(212.39, 214.42)	(268.28, 214.64)	(213.53, 241.31)	(266.76, 241.24)
Figure 11b	(293.85, 201.79)	(345.34, 202.49)	(294.15, 227.63)	(343.94, 227.71)
Figure 11c	(296.90, 191.21)	(348.58, 191.36)	(298.34, 217.17)	(347.55, 217.43)
Figure 11d	(314.53, 196.17)	(366.49, 197.12)	(316.03, 221.18)	(365.87, 222.52)
Figure 12a	(264.25, 207.31)	(318.64, 208.15)	(265.12, 235.43)	(317.20, 235.98)
Figure 12b	(211.39, 133.26)	(252.13, 134.68)	(221.64, 149.52)	(251.54, 149.22)
Figure 12c	(265.47, 186.29)	(321.40, 186.24)	(263.68, 216.44)	(319.87, 215.31)
Figure 12d	(284.31, 152.37)	(331.82, 152.21)	(283.14, 176.33)	(329.13, 176.45)

3.3. Stability and Error

To evaluate the stability and accuracy of proposed method, 105 eye images of each subject are acquired for pupil and glint detection. Stability, RMS error and processing time of proposed method in this paper are shown in Table 5. As a reference, stability, RMS error and processing time of detection methods in paper [13,20,21,22] are listed in Table 5. As can be seen from the experimental results in Table 5, stability, accuracy and real-time quality of the proposed method are better than those in paper [13,20,21,22].

Table 5. Stability, RMS error and processing time of different methods.

**Table 5.** Stability, RMS error and processing time of different methods.
Method	Pupil Detection			Glint Detection
Method	Stability	Error (Pixels)	Time (ms)	Stability	Error (Pixels)	Time (ms)
Proposed method	99.4%	2.17	43.6	98.7%	0.69	21.5
Paper [13]	94.9%	6.48	92.1	90.9%	1.73	38.6
Paper [20]	95.2%	7.86	65.5	94.1%	1.28	34.1
Paper [21]	97.9%	5.43	54.3	-	-	-
Paper [22]	96.6%	5.95	126.4	-	-	-

4. Conclusions

A novel and robust method of pupil and glint detection using a wearable camera sensor and near-infrared LED array for gaze tracking system is proposed in this paper. A circular ring rays location (CRRL) method is proposed for detection of pupil boundary points. An improved Otsu method is proposed for threshold segmentation. The experimental results show that the segmentation time of improved method is less than that of original Otsu, which contributes to the real-time quality of eye gaze tracking. Size and number of gradient amplitude are employed to eliminate interference factors. In order to compensate for errors of pupil boundary points in horizontal and vertical direction, improved total least squares is developed to fit ellipse. The experimental results show that the improved total least squares has a higher accuracy than original total least squares on pupil ellipse fitting. For the purpose of a higher location accuracy of glint, improved total least squares is utilized for the solution of Gaussian function deformation to calculate glint center. As we can see from the experimental results, stability, accuracy and real-time quality of the proposed method are better than those existing currently for pupil and glint detection. When interference factors such as glints and natural light reflection are located neighboring pupil boundary, interference points caused can be eliminated fast and effectively. The proposed method contributes to the enhancement of stability, accuracy and real-time quality of gaze tracking system.

Acknowledgments

This work is supported by Program for Changjiang Scholars and Innovation Research Team in University under Grant No. IRT1208 and Basic Research Fund of Beijing Institute of Technology under Grant (No. 20130242015). We would like to thank the editor and all anonymous reviewers for their constructive suggestions.

Author Contributions

All authors have siginificant cointributions to this article. Jianzhong Wang was mainly respoisible for deployment of the system and revision of the paper; Guangyue Zhang was responsible for developing pupil and glint detection method and writing the paper; Jiadong Shi, the corresponding author, was responsible for performing experiments and analysing data.

Conflicts of Interest

The authors declare no conflict of interest.

References

Blondon, K.; Wipfli, R.; Lovis, C. Use of eye-tracking technology in clinical reasoning: a systematic review. Stud. Health Technol. Inf. 2015, 210, 90–94. [Google Scholar]
Higins, E.; Leinenger, M.; Rayner, K. Eye movements when viewing advertisements. Front. Psychol. 2014, 210. Available online: http://journal.frontiersin.org/article/10.3389/fpsyg.2014.00210/full (accessed on 30 November 2015). [Google Scholar] [CrossRef] [PubMed]
Spakov, O.; Majaranta, P. Scrollable keyboards for casual eye typing. Psychol. J. 2009, 7, 159–173. [Google Scholar]
Noureddin, B.; Lawrence, P.D.; Man, C.F. A non-contact device for tracking gaze in human computer interface. Comput. Vis. Image Underst. 2005, 98, 52–82. [Google Scholar] [CrossRef]
Biswas, P.; Langdon, P. Multimodal intelligent eye-gaze tracking system. Int. J. Hum. Comput. Interact. 2015, 31, 277–294. [Google Scholar] [CrossRef]
Lim, C.J.; Kim, D. Development of gaze tracking interface for controlling 3D contents. Sens. Actuator A Phys. 2012, 185, 151–159. [Google Scholar] [CrossRef]
Yarbus, A.L. Eye Movements and Vision; Plenum Press: New York, NY, USA, 1967. [Google Scholar]
Dodge, R.; Cline, T.S. The angle velocity of eye movements. Psychol. Rev. 1901, 8, 145–157. [Google Scholar] [CrossRef]
Ditchburn, R.W. Eye movements and Visual Perception; Clarendon Press: Oxford, UK, 1973. [Google Scholar]
Miles, W. The peep-hole method for observing eye movements in reading. J. Gen. Psychol. 1928, 1, 373–374. [Google Scholar] [CrossRef]
Robinson, D.A. A method of measuring eye movements using a scleral search coil in a magnetic field. IEEE Trans. Biomed. Eng. 1963, 10, 137–145. [Google Scholar] [PubMed]
Cornsweet, T.N.; Crane, H.S. Accurate two-dimensional eye tracker using first and fourth Purkinje images. J. Opt. Soc. Am. 1973, 63, 921–928. [Google Scholar] [CrossRef] [PubMed]
Ohno, T.; Mukawa, N.; Yoshikawa, A. Free gaze: a gaze tracking system for everyday gaze interaction. In Proceedings of the Symposium on Eye Tracking Research and Applications Symposium, New Orleans, LA, USA, 25–27 March 2002; pp. 125–132.
Goñi, S.; Echeto, J.; Villanueva, A.; Cabeza, R. Robust algorithm for pupil-glint vector detection in a video-oculography eye tracking system. In Proceedings of the International Conference on Pattern Recognition, Cambridge, UK, 23–26 August 2004; pp. 941–944.
Villanueva, A.; Cabeza, R. A novel gaze estimation system with one calibration point. IEEE Trans. Syst. Man Cybern. 2008, 38, 1123–1138. [Google Scholar] [CrossRef] [PubMed]
Gneo, M.; Schmid, M.; Conforto, S.; D’Alessio, T. A free geometry model-independent neural eye-gaze tracking system. J. NeuroEng. Rehabil. 2002, 82. [Google Scholar] [CrossRef] [PubMed]
Blignaut, P. Mapping the pupil-glint vector to gaze coordinates in a simple video-based eye tracker. J. Eye Mov. Res. 2014, 7, 1–11. [Google Scholar]
Lai, C.C.; Shih, S.W.; Hung, Y.P. Hybrid method for 3-D gaze tracking using glint and contour features. IEEE Trans. Circuits Syst. Video Technol. 2015, 25, 24–37. [Google Scholar]
Ebisawa, Y. Unconstrained pupil detection technique using two light sources and the image difference method. Visual. Intell. Des. Engine Arch. 1995, 15, 79–89. [Google Scholar]
Yoo, D.H.; Chung, M.J.; Ju, D.B.; Choi, I.H. Non-intrusive eye gaze estimation using a projective invariant under head movement. In Proceedings of the IEEE International Conference on Robotics and Automation, Orlando, FL, USA, 15–19 May 2006; pp. 3443–3448.
Gwon, S.Y.; Cho, C.W.; Lee, H.C. Robust eye and pupil detection method for gaze tracking. Int. J. Adv. Robot. Syst. 2013, 10, 1–7. [Google Scholar]
Li, D.H.; Winfield, D.W.; Parkhurst, D.J. Starburst: A hybrid algorithm for video-based eye tracking combining feature-based and model-based approaches. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), San Diego, CA, USA, 25–25 June 2005; pp. 79–86.
Krishnamoorthi, R.; Annapoorani, G. A simple boundary extraction technique for irregular pupil localization with orthogonal polynomials. Comput. Vis. Image Underst. 2012, 116, 262–273. [Google Scholar] [CrossRef]
Sliney, D.; Aron-Rosa, D.; DeLori, F.; Fankhauser, F.; Landry, R.; Mainster, M.; Marshall, J.; Rassow, B.; Stuck, B.; Trokel, S.; et al. Adjustment of guidelines for exposure of the eye to optical radiation from ocular instruments: Statement from a task group of the International Commission on Non-Ionizing Radiation Protection. Appl. Opt. 2005, 44, 2162–2176. [Google Scholar] [CrossRef] [PubMed]
Otsu, N. A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 1979, 9, 62–66. [Google Scholar]
Truchetet, F.; Nicolier, F.; Laligant, O. Subpixel edge detection for dimensional control by artificial vision. J. Electron. Imaging 2001, 10, 234–239. [Google Scholar] [CrossRef]
Pearson, K. On lines and planes of closest fit to systems of points in space. Philos. Mag. 1901, 2, 559–572. [Google Scholar] [CrossRef]
Golub, G.H.; Van Loan, C.F. An analysis of the total least squares problem. SIAM J. Numer. Anal. 1980, 177, 883–893. [Google Scholar] [CrossRef]
Gander, W.; Golub, G.H.; Strebel, R. Least-squares fitting of circles and ellipses. BIT Numer. Math. 1994, 34, 558–578. [Google Scholar] [CrossRef]
Shortis, M.R.; Clarke, T.A.; Short, T. A comparison of some techniques for the subpixel location of discrete target images. In Photonics for Industrial Applications, Proceedings of the International Society for Optics and Photonics, Boston, MA, USA, 6 October 1994; pp. 239–259.

© 2015 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons by Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, J.; Zhang, G.; Shi, J. Pupil and Glint Detection Using Wearable Camera Sensor and Near-Infrared LED Array. Sensors 2015, 15, 30126-30141. https://doi.org/10.3390/s151229792

AMA Style

Wang J, Zhang G, Shi J. Pupil and Glint Detection Using Wearable Camera Sensor and Near-Infrared LED Array. Sensors. 2015; 15(12):30126-30141. https://doi.org/10.3390/s151229792

Chicago/Turabian Style

Wang, Jianzhong, Guangyue Zhang, and Jiadong Shi. 2015. "Pupil and Glint Detection Using Wearable Camera Sensor and Near-Infrared LED Array" Sensors 15, no. 12: 30126-30141. https://doi.org/10.3390/s151229792

Article Menu

Pupil and Glint Detection Using Wearable Camera Sensor and Near-Infrared LED Array

Abstract

1. Introduction

2. Proposed Method

2.1. Proposed Gaze Tracking Device

2.2. Pupil Detection

2.2.1. Binarization and Opening-and-Closing Operation

2.2.2. Rough Location of Pupil Area and Center

2.2.3. Collection of Pupil Boundary Points

2.2.4. Ellipse Fitting

2.3. Glint Detection

2.3.1. Rough Location of Glint Region

2.3.2. Gaussian Fitting

3. Experimental Results

3.1. Pupil Detection

3.1.1. Pupil Detection of Single Subject

3.1.2. Pupil Detection of Different Subjects

3.2. Glint Detection

3.3. Stability and Error

4. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI