An Improved Fuzzy Vector Signature with Reusability

Lim, Ilhwan; Seo, Minhye; Lee, Dong Hoon; Park, Jong Hwan

doi:10.3390/app10207141

Open AccessArticle

An Improved Fuzzy Vector Signature with Reusability

¹

Graduate School of Information Security, Korea University, Seoul 02841, Korea

²

Department of Cyber Security, Duksung Women’s University, Seoul 01369, Korea

³

Department of Computer Science, Sangmyung University, Seoul 03016, Korea

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2020, 10(20), 7141; https://doi.org/10.3390/app10207141

Submission received: 24 July 2020 / Revised: 25 September 2020 / Accepted: 1 October 2020 / Published: 14 October 2020

(This article belongs to the Special Issue Design and Security Analysis of Cryptosystems)

Download

Browse Figures

Versions Notes

Abstract

:

Fuzzy vector signature (FVS) is a new primitive where a fuzzy (biometric) data w is used to generate a verification key

({VK}_{w})

, and, later, a distinct fuzzy (biometric) data

w^{'}

(as well as a message) is used to generate a signature

(σ_{w^{'}})

. The primary feature of FVS is that the signature

(σ_{w^{'}})

can be verified under the verification key

({VK}_{w})

only if w is close to

w^{'}

in a certain predefined distance. Recently, Seo et al. proposed an FVS scheme that was constructed (loosely) using a subset-based sampling method to reduce the size of helper data. However, their construction fails to provide the reusability property that requires that no adversary gains the information on fuzzy (biometric) data even if multiple verification keys and relevant signatures of a single user, which are all generated with correlated fuzzy (biometric) data, are exposed to the adversary. In this paper, we propose an improved FVS scheme which is proven to be reusable with respect to arbitrary correlated fuzzy (biometric) inputs. Our efficiency improvement is achieved by strictly applying the subset-based sampling method used before to build a fuzzy extractor by Canetti et al. and by slightly modifying the structure of the verification key. Our FVS scheme can still tolerate sub-linear error rates of input sources and also reduce the signing cost of a user by about half of the original FVS scheme. Finally, we present authentication protocols based on fuzzy extractor and FVS scheme and give performance comparison between them in terms of computation and transmission costs.

Keywords:

fuzzy vector signature; reusability; biometric authentication

1. Introduction

Biometric information (e.g., fingerprint, iris, face, vein) has been used for user authentication [1,2,3,4,5] because of its uniqueness and immutability. Due to these properties, such biometric information can be used in place of a user secret key in an authentication system. When using biometric information as a security key, the user is not required to memorize or securely store anything to authenticate, which makes the process much more convenient and user-friendly. However, since biometric information is noisy and non-uniformly distributed, it differs greatly from what is known about cryptographic secret keys. In general, a secret key of an authentication system is largely set to be a uniformly random string of fixed length. Until now, a large body of research has been conducted to bridge this gap and enable biometric information to be used as a secret key in a cryptographic way.

To overcome the problem of noisy secret keys, researchers proposed the fuzzy extractor and fuzzy signature as two types of biometric cryptosystems. Comprised of two algorithms, Gen and Rep, a fuzzy extractor [6] is able to generate a uniformly random string of fixed length (i.e., a secret key) from fuzzy (biometric) data. The generation algorithm Gen takes as input sample f fuzzy (biometric) data w to generate a secret key r together with helper data p. The reproduction algorithm Rep takes as input another sample of fuzzy (biometric) data

w^{'}

close to w and p to reproduce r. If the difference between w and

w^{'}

is less than a minimal threshold value, Rep can generate the same secret key r. On the other hand, consisting of three algorithms, KG, Sign, and Vrfy, the fuzzy signature [7] generates a signature by using fuzzy (biometric) data itself as a signing key. The key generation algorithm KG takes as input sample of fuzzy (biometric) data w to generate a verification key

v k

. The signing algorithm Sign takes as input another sample of biometric information

w^{'}

to generate a signature

σ

. The verification algorithm Vrfy takes as input

v k

and

σ

and succeeds in verification if w and

w^{'}

are close to within a fixed threshold.

Biometric cryptosystems are generally considered secure when each sample of fuzzy (biometric) data is used only once. However, in reality, a user may use the same biometric source (e.g., right index fingerprint) to authenticate their accounts for several applications as a matter of expediency. Since similar biometric information is used multiple times, a new security notion is required to guarantee both the privacy of the fuzzy (biometric) data at hand and the reusable security of biometric cryptosystems in this situation. In 2004, Boyen [8] introduced the reusability of a fuzzy extractor, which ensures no entropy loss to the secret key or biometric source even when relevant pairs of helper data or keys from similar forms of biometric information are revealed. Since then, many researchers have focused on studying this fuzzy extractor with reusability (a.k.a. reusable fuzzy extractor).

In order for a reusable fuzzy extractor to be widely used in practice, it should be able to tolerate more than a certain level of errors inherent in fuzzy (biometric) data. For example, iris readings have the average error rate of 20–30% [9,10,11]. A number of studies have proposed constructions that tolerate linear fraction of errors [8,12,13,14,15], but these schemes impose a strong requirement on the distribution of fuzzy (biometric) data, namely: (1) the distribution must have sufficiently high min-entropy (“Y” in High min-entropy in Table 1), or (2) any difference between two distinct biometric readings cannot significantly decrease the min-entropy of the fuzzy (biometric) data (“

H_{\infty} [w_{i} | w_{i} - w_{j}] > m

” in Source Distribution in Table 1). Unfortunately, both of these expectations are somewhat unrealistic.

Canetti et al. [17] relaxed these conditions, only requiring a subset of samples to have sufficient average min-entropy for given subset indices, and proposed a reusable fuzzy extractor that would tolerate a sub-linear fraction of errors. The construction is contingent on the existence of a powerful tool called digital locker, which is a non-standard assumption. Other reusable biometric cryptosystems [16,19] did not require either unrealistic requirements on the biometric distribution or contain non-standard assumptions, but they only tolerated a logarithmic fraction of errors.

Recently, a new primitive called fuzzy vector signature (FVS) [20] was proposed based on bilinear maps (i.e., pairings), which improved the error tolerance rate without any additional requirements on the distribution of biometric information. This scheme tolerates a sub-linear fraction of errors and also is based on standard assumptions, like the external Diffie-Hellman (XDH). It is also claimed to be reusable, but no formal proof for reusability was provided in Reference [20]. In this paper, we introduce the formal security model for reusability of fuzzy vector signature (FVS) and prove that our proposed scheme is reusable in the reusability model. By more strictly applying the subset-based sampling method [17], our scheme is also more efficient than Reference [20] from the perspective of the user and the authentication server. Specifically, it reduces not only the size of the signature and verification key but also the number of pairing operations required for verification. Section 5 outlines a detailed performance comparison.

1.1. Related Work

Reusable Fuzzy Extractor. The concept of a fuzzy extractor and secure sketch was first proposed by Dodis et al. [6]. Following this, Boyen et al. [8] introduced the notion of reusability, meaning that additional keys could be securely generated for any helper data, which is required to regenerate the key, even if the helper data or key pairs were exposed. Wen et al. [14] subsequently proposed a reusable fuzzy extractor based on the Decisional Diffie-Hellman (DDH) problem, which can tolerate a linear size of errors. However, their scheme requires that, for any two distinct readings of the same source, the differences between them should not leak significant information about the source—a requirement that is too strict, as each component of fuzzy (biometric) data is non-uniformly distributed. In response to this, Wen et al. [15] proposed a DDH-based reusable fuzzy extractor to remove the requirement [14] by changing the source distribution. Apart from these studies, Wen et al. [13] also proposed a reusable fuzzy extractor that allowed a linear fraction of errors based on the Learning with Errors (LWE) assumption [22]. However, their schemes [13,14,15] used a secure sketch as a method for controlling noise of data. A secure sketch is used for recovering w from

w^{'}

if w and

w^{'}

are close to within a fixed distance, but it nevertheless causes a small leakage of biometric information. Therefore, when considering reusability scenarios, it is noted that these schemes require an input source to have sufficiently high min-entropy to avoid brute-force attack even after security loss.

On the other side of the spectrum are reusable fuzzy extractors that did not suggest using secure sketches. Canetti et al. [17] proposed a reusable fuzzy extractor that is constructed using the subset-based sampling technique instead of a secure sketch to control a noisy data source. This scheme relied on the use of a digital locker to generate helper data. However, even if a digital locker was simply instantiated with a hash function, the size of helper data would increase significantly (e.g., about 0.8 GB to achieve an error tolerance rate of

20 %

). Cheon et al. [18] modified this scheme [17] to reduce the size of the helper data, but in turn, the computational cost for Rep increased significantly. Alamélou et al. [12] also improved [17] and suggested a fuzzy extractor that achieved a linear fraction of errors. However, they also used a digital locker and this scheme had an unrealistic requirement that each component of a fuzzy (biometric) source had to have significant min-entropy. Apon et al. [16] modified a non-reusable fuzzy extractor [23] based on the LWE assumption into a reusable one. However, their scheme can tolerate only a logarithmic fraction of errors due to the time-consuming process of reproducing a key even with a small number of errors.

Fuzzy Signature. The concept of a fuzzy signature was proposed by Takahashi et al. [7]. The fuzzy signature does not need helper data, unlike the fuzzy extractor, because it only requires a valid correlation between two input fuzzy (biometric) data relevant to a verification key and a signature, which can be achieved using a linear sketch. However, in Reference [7], a strong assumption was required that input fuzzy (biometric) data should be uniformly distributed, which is then relaxed in Reference [24]. Afterwards, Yasuda et al. announced that a linear sketch of fuzzy extractors [7,24] is vulnerable to recovery attacks [25]. In Reference [26], the merge of Reference [7,24], Takahashi et al. proposed two instantiations of a fuzzy signature secure against recovery attacks. In 2019, Tian et al. first introduced the notion of reusability in a fuzzy signature [19]. To construct a reusable fuzzy signature, they adopted a reusable fuzzy extractor in Reference [16]. Consequently, the reusability is limited only to the generation of verification keys, ignoring the privacy of signatures.

Fuzzy Vector Signature. Seo et al. first proposed a fuzzy vector signature [20] following the subset-based sampling method of Reference [17]. The fuzzy vector signature requires the signing parameter like helper data in fuzzy extractor, to be reusable, but the size of the signing parameter is much smaller than that of helper data in Reference [17]. In addition, a fuzzy vector signature can tolerate sub-linear errors, while a reusable fuzzy signature [19] can only tolerate logarithmic errors. However, the size of verification key in Reference [20] is still huge, which results in high computational costs for verification. In addition, security models for reusability are incomplete as in Reference [19], and, as a result, no formal proof of reusability is provided.

1.2. Source Distributions

A source distribution of reusable biometric cryptosystems can be categorized into four types, according to correlation between two repeating readings, as in Table 1.

“

w_{i} = w + δ_{i}

” implies that the hamming distance between

w \in W

and

w + δ_{i}

is less than a threshold value t, where W is an input source. Since

δ_{i}

is randomly chosen by an adversary,

w + δ_{i}

may not be included in W, which is a little far from reality. Especially in Reference [16,19], additional assumptions were made that W should be a small error distribution

χ

of Learning with Errors (LWE) problem and both w and

w + δ_{i}

should be in W to tolerate logarithmic errors.

“

H_{\infty} [w_{i} | w_{i} - w_{j}] > m

” implies that, for any two distinct readings

w_{i}

and

w_{j}

in W,

H_{\infty} [w_{i} | w_{i} - w_{j}] > m

holds where m is a minimum level of security. In other words, the difference between

w_{i}

and

w_{j}

(i.e.,

w_{i} - w_{j}

) should not leak too much information of

w_{i}

even if

w_{i} - w_{j}

is correlated to

w_{i}

, which is a strong requirement for the input source.

“

(w, w_{i})

” implies that any two distinct readings are arbitrarily correlated, which would be the most realistic assumption. However, as a trade-off, schemes based on this distribution require additional assumptions on the input source. For example, in Reference [17,18], any subset of

W = (W [1],

\dots,

W [n])

should have high min-entropy even if indices are exposed, and, in Reference [12], each component

W [j]

should have a high min-entropy even if other components are exposed.

Unlike the above three types, a previous fuzzy vector signature [20] used a

(T, k)

-block source in Reference [21], although following the subset concept in Reference [17] for construction.

(T, k)

-block source means that for input sources

W_{1}, \dots, W_{T}

, i-th source has a high min-entropy even though

i - 1

readings are set to

w_{1}, \dots, w_{i - 1}

, i.e.,

H_{\infty} [W_{i} | W_{1} = w_{1}, \dots, W_{i - 1} = w_{i - 1}] > k

for all

i \in [1, T]

where k is a minimum level of security. Namely,

W_{1}, \dots, W_{T}

are not correlated. However, since fuzzy (biometric) data from the same source must be correlated, it is inappropriate to use the

(T, k)

-block source when considering reusability.

1.3. Contribution

In this paper, we propose a new fuzzy vector signature (FVS) scheme based on a subset concept in Reference [17] to deal with noise in fuzzy (biometric) data. In consequence, our scheme is reusable and can tolerate sub-linear errors without any additional requirements, such as sufficiently high min-entropy of the input source. Compared to the previous fuzzy vector signature [20], we eliminated redundant parts in both verification key and signature by trying a new approach to security proofs, which in turn improved the efficiency of the scheme. For instance, for 80-bit security with

20 %

error tolerance rates, we reduce the size of the signing parameter by

33 %

, from 48 KB to 32 KB, the signature by

50 %

, from 32 KB to 16 KB, and a verification key by

22 %

, from

1.61

GB to

1.25

GB, where the length of the fuzzy (biometric) data is

n = 512

bits. Additionally, we reduce the number of pairing operations in verification by up to

33 %

from

18.5 \times 10^{6}

to

12.3 \times 10^{6}

.

We also provide the formal security models to reflect reusability of the privacy of the verifier and signer (i.e., VK-privacy and SIG-privacy, respectively) and the unforgeability of the signature (i.e., Reusability). And we prove these properties on the assumption that the repeating readings of fuzzy (biometric) data are arbitrarily correlated, which is more realistic than the

(T, k)

-block source used in Reference [20]. In addition, we analyze the performance of our FVS scheme in terms of transmission and computational costs by comparing it with previous reusable biometric cryptosystems [17,18,20]. In particular, a signature can be generated in 155 ms on the signer’s side, which is almost twice as fast as in Reference [20] (for more details, see Section 5).

2. Preliminaries

2.1. Notation

Let

λ

be the security parameter and

poly (λ)

denote a polynomial in variable

λ

. We use the acronym “PPT“ to indicate probabilistic polynomial-time. Let

W

be a fuzzy (biometric) space with metric space

M

and

Z

be a set of arbitrary alphabet. We denote

w \leftarrow W

as the process of sampling w according to a random variable W such that

W \in W

. Let

w = {w [1], \dots, w [n]}

be a fuzzy vector of length n. We denote string concatenation with the symbol “

| |

,” and U represents an arbitrary random distribution.

2.2. Hamming Distance Metric

A metric space is a set

M

with a non-negative integer distance function dis:

M^{2} \to Z \cup {0}

. The elements of

M

are vectors in

Z^{n}

for some alphabet sets

Z

. For any

w, w^{'} \in M

, the hamming distance dis

(w, w^{'}) = | {i | w_{i} \neq {w^{'}}_{i}} |

is defined as the number of components in which w and

w^{'}

differ.

2.3. Min-Entropy

Let X and Y be random variables. The min-entropy of X is defined as

H_{\infty} [X] = - {log}_{2} ({max}_{x \in X} P [X = x])

. For conditional distributions, the average min-entropy of X given Y is defined by

{\tilde{H}}_{\infty} [X | Y] = - {log}_{2} (E_{y \in Y} [{max}_{x \in X}

P [X = x | Y = y]])

. For a given source

X = (X [1],

\dots,

X [n])

, we say that X is a source with

α

-entropy ℓ-samples if

\begin{matrix} {\tilde{H}}_{\infty} [X [j_{1}], \dots, X [j_{ℓ}] | j_{1}, \dots, j_{ℓ}] \geq α, \end{matrix}

where

α

and ℓ are determined by the security parameter

λ

.

2.4. Statistical Distance

The statistical distance between random variables X and Y with the same domain is defined by

\begin{matrix} Δ (X, Y) = \frac{1}{2} \sum_{x} | Pr [X = x] - Pr [Y = x] | . \end{matrix}

2.5. Universal Hash Function

A collection

H

of hash functions

H : U \to V

is

1 / | V |

-universal if for any

x_{1}, x_{2} \in U

such that

x_{1} \neq x_{2}

, it holds that

{Pr}_{H \leftarrow H} [H (x_{1}) = H (x_{2})] = \frac{1}{| V |}

.

Lemma 1.

(Reference [6]) Let

A, B, C

be random variables. Then,

(a): For any $ξ > 0$ , the conditional entropy $H_{\infty} [A | B = b]$ is at least ${\tilde{H}}_{\infty} [A | B] - log (1 / ξ)$ with a probability of at least $1 - ξ$ over the choice of b.
(b): If B has at most $2^{λ}$ possible values, then ${\tilde{H}}_{\infty} [A | (B, C)]$ $\geq {\tilde{H}}_{\infty} [(A, B) | C] - λ \geq {\tilde{H}}_{\infty} [A | C] - λ$ . In particular, ${\tilde{H}}_{\infty} [A | B] \geq H_{\infty} [(A, B)] - λ \geq H_{\infty} [A] - λ$ .

Lemma 2.

(Reference [27]) Let

(X, Z)

be any two jointly distributed random variables and Z has at most

2^{v}

possible values. Then, for any

ϵ > 0

it holds that

Pr [H_{\infty} [X | Z = z] \geq H_{\infty} [X] - v - log (1 / ϵ)] \geq 1 - ϵ

.

Lemma 3.

Let

H

be a family of universal hash functions

H : U \to V

, and let

(X_{1}, \dots, X_{q})

be a joint distribution such that

H_{\infty} [X_{i}] \geq α

for

i = 1, . . ., q

and

α \geq q \cdot log | V | + 3 log (1 / ϵ) + Θ (1)

. Then, the distribution

(H_{1}, H_{1} (X_{1}), \dots, H_{q}, H_{1} (X_{q}))

, where

(H_{1}, . . ., H_{q}) \leftarrow H^{q}

, is

2 ϵ q

-close to the uniform distribution over

{(H \times V)}^{q}

.

Proof.

Proof of Lemma 3 is in Appendix A. □

2.6. Discrete Logarithm Assumption

Let

G

be a group of prime order q, and let g be a generator of

G

. For any PPT algorithm

A

, we define the advantage of

A

, denoted by

{Adv}_{A}^{DL} (λ)

, in solving the discrete logarithm (DL) problem as follows:

\begin{matrix} {Adv}_{A}^{DL} = Pr [a \leftarrow A (G, q, g, g^{a}); a \leftarrow_{R} Z_{q}] \end{matrix}

We say that the DL assumption holds in

G

if, for all PPT algorithms

A

and any security parameter

λ

,

{Adv}_{A}^{DL} (λ) < ϵ (λ)

for some negligible function

ϵ

.

2.7. Bilinear Maps

Let

G_{1}, G_{2}, G_{T}

be groups of some prime order q and a bilinear map (or pairing)

e : G_{1} \times G_{2} \to G_{T}

over

(G_{1}, G_{2})

be admissible if it satisfies the following properties:

Bilinear: The map is bilinear if $e (g^{a}, h^{b}) = e {(g, h)}^{a b}$ for all $g \in G_{1}, h \in G_{2}$ .
Non-degenerate: $e (g, h) \neq 1$ .
Computable: There is an efficient algorithm to compute $e (g, h)$ for all $g \in G_{1}, h \in G_{2}$ .

Our fuzzy vector signature is constructed using a Type-3 pairing where

G_{1} \neq G_{2}

and there is no known efficiently computable isomorphism between

G_{1}

and

G_{2}

.

2.8. External Diffie-Hellman (XDH) Assumption

Let

G_{1}

,

G_{2}

be groups with prime order q and

g, h

be generators of

G_{1}

,

G_{2}

, respectively. The XDH problem in

G_{1}

is defined as follows: given

D = (g, g^{a}, g^{b}, h) \in G_{1}^{3} \times G_{2}

and

T \in G_{1}

with a Type-3 pairing, the goal of an adversary

A

is to distinguish whether T is either

g^{a b}

or random R. For any PPT algorithm

A

, the advantage in solving the XDH problem in

G_{1}

is defined as

\begin{matrix} {Adv}_{A}^{XDH} (λ) = | Pr [A (D, g^{a b}) \to 1] - Pr [A (D, T) \to 1] | . \end{matrix}

We say that an XDH assumption holds in

G_{1}

if, for any PPT algorithm

A

, the advantage

{Adv}_{A}^{XDH} (λ)

is negligible for

λ

.

3. Definitions

3.1. Syntax of Fuzzy Vector Signature

Let

W

be a fuzzy (biometric) space with the Hamming distance metric

M

and w is a sample of random variable

W \in W

. A fuzzy vector signature (FVS) scheme [20] is defined by three algorithms (Setup, Sign, Verify) as follows:

Setup $(1^{λ}, w, n, d, ℓ, t)$ : The setup algorithm takes as input the security parameter $1^{λ}$ , a sample of fuzzy (biometric) data $w \leftarrow W$ , the length n of w, the number d of subsets, the number ℓ of elements included into each subset, and the maximum number t of errors that can be tolerated. It generates a signing parameter $SP$ and verification key ${VK}_{w}$ corresponding to w. Here, $t / n$ is said to be the error tolerance rate.
Sign $(SP, w^{'}, m)$ : The signature generation algorithm takes as input a signing parameter $SP$ , a sample of fuzzy (biometric) data $w^{'} \leftarrow W$ of length n, and a message m. It generates a signature $σ_{w^{'}}$ corresponding to $w^{'}$ .
Verify $({VK}_{w}, σ_{w^{'}}, m)$ : The verification algorithm takes as input a verification key ${VK}_{w}$ , a signature $σ_{w^{'}}$ , and a message m. If a signature $σ_{w^{'}}$ is valid under the condition that $dis (w, w^{'}) \leq t$ , it outputs 1; otherwise 0.

Correctness. Let

δ

be the probability that the Verify algorithm outputs 0, and let dis

(w, w^{'}) \leq t

for two sample of fuzzy (biometric) data

w, w^{'} \in M

. For the signing parameter

SP

and the verification key

{VK}_{w}

, if

σ_{w^{'}}

←Sign

(SP, w^{'}, m)

, then

Pr [

Verify $({VK}_{w}, σ_{w^{'}}, m)$

= 1]

\geq 1 - δ

.

3.2. Security Models

We considered three security notions for FVS security: VK-privacy, SIG-privacy, and reusability.

3.2.1. VK-Privacy

VK-privacy blocks an adversary from obtaining any information about fuzzy input data w from a verification key

{VK}_{w}

. In other words, the adversary cannot distinguish between a real

{VK}_{w}

and a random R. We say that an FVS scheme is VK-private if the advantage that any PPT adversary

A

wins against the challenger

C

in the following game for any

j = 1, \dots, q

is negligible in

λ

:

Setup: $A$ selects target correlated random variables $W = (W_{1}, \dots, W_{q}) \in W^{q}$ and gives these to $C$ .
Generation: $C$ samples $(w_{1}, \dots, w_{q}) \in (W_{1}, \dots, W_{q})$ . $C$ runs Setup $(1^{λ}, w_{k}, n, d, ℓ, t)$ for $k = 1, \dots, q$ , and chooses random R. $C$ chooses one of the modes, real or random experiment. For any $k \in [1, q]$ , if the mode is real, $C$ gives $({VK}_{w_{1}}, \dots, {VK}_{w_{k}}, \dots, {VK}_{w_{q}}, {SP}_{1}, \dots, {SP}_{q})$ ; otherwise $({VK}_{w_{1}}, \dots, R, \dots, {VK}_{w_{q}}, {SP}_{1}, \dots, {SP}_{q})$ .
Distinguishing: For k, $A$ outputs a bit $b \in {0, 1}$ that, respectively, represents a real or random experiment.

Definition 1 (VK-privacy.).

An FVS scheme

Π_{f}

is VK-private if, for any PPT adversary

A

against VK-privacy, there exists a negligible function

ν (λ)

such that

{Adv}_{Π_{f}, A}^{V K} (λ) ≜

\begin{matrix} | Pr [A ({VK}_{w_{1}}, . . ., {VK}_{w_{k}}, . . ., {VK}_{w_{q}}, {SP}_{1}, . . ., {SP}_{q}) = 1] - \\ Pr [A ({VK}_{w_{1}}, . . ., R, . . ., {VK}_{w_{q}}, {SP}_{1}, . . ., {SP}_{q}) = 1] | \leq ν (λ) . \end{matrix}

3.2.2. SIG-Privacy

SIG-privacy means that an adversary cannot ascertain any information on fuzzy input data

w^{'}

from a signature

σ_{w^{'}}

. The adversary cannot distinguish between a valid signature

σ_{w^{'}}

and a signature corresponding to a uniformly random fuzzy data u without a corresponding verification key. We say that an FVS scheme is SIG-private if the advantage that any PPT adversary

A

wins against the challenger

C

in the following game for

j = 1, \dots, q

is negligible in

λ

:

Setup: $A$ selects target correlated random variables $W = (W^{*}, W_{1}, \dots, W_{q}) \in W$ and gives it to $C$ . $C$ chooses $(w^{*}, w_{1}, \dots, w_{q}) \in (W^{*}, W_{1}, \dots, W_{q})$ . Then, $C$ runs Setup $(1^{λ}, w_{j}, n, d, ℓ, t)$ for $j \in [1, q]$ and Setup $(1^{λ}, w^{*}$ , $n, d, ℓ, t)$ . $C$ sends ${{VK}_{w_{j}},$ ${SP}_{j}}_{j = 1}^{q}$ and ${SP}^{*}$ to $A$ .
Query: $A$ issues a random variable $W^{'}$ correlated with $(W^{*}, W_{1}, \dots, W_{q})$ , a message $m_{k}$ , and an index $j \in [1, q]$ of the signing parameter ${SP}_{j}$ . $C$ chooses $w^{'} \leftarrow W^{'}$ correlated with $(w^{*}, w_{1}, \dots, w_{q})$ , runs Sign $({SP}_{j}, w^{'}, m_{k})$ , and gives the resulting signature to $A$ .
Challenge: $A$ issues a message $m^{*}$ . $C$ obtains $σ^{*} \leftarrow Sign ({SP}^{*},$ $w^{*}, m^{*})$ and selects a bit $b \in {0, 1}$ at random. If $b = 0$ , $C$ sends $σ^{*}$ to $A$ . Otherwise, $C$ selects a random input u from a uniformly random distribution U, obtains $σ^{*} \leftarrow Sign ({SP}^{*},$ $u, m^{*})$ , and gives $σ^{*}$ to $A$ .
Guess: $A$ outputs its guess $b^{'} \in {0, 1}$ . If $b^{'} = b$ , $A$ wins the game.

Definition 2 (SIG-privacy.).

An FVS scheme

Π_{f}

is SIG-private if, for any PPT adversary

A

against SIG-privacy, there exists a negligible function

ν (λ)

such that

\begin{matrix} {Adv}_{Π_{f}, A}^{S I G} (λ) ≜ | Pr [b = b^{'}] - \frac{1}{2} | \leq ν (λ) . \end{matrix}

3.2.3. Reusability

Reusability means that an adversary cannot generate a valid signature without knowing the target input source of data, even if the adversary is given verification keys and signing parameters correlated with the target input data. In addition, the adversary can get valid signatures that are verified with the target verification key or other (correlated) verification keys via signing oracles. We say that an FVS scheme is reusable if the advantage that any PPT adversary

A

wins against the challenger

C

in the following game is negligible in

λ

:

Setup: $A$ selects correlated random variables $(W_{1}, \dots,$ $W_{q})$ $\in W^{q}$ and gives it to $C$ . $C$ chooses $(w_{1}, \dots, w_{q})$ $\in (W_{1}, \dots, W_{q})$ and runs Setup $(1^{λ}, w_{j}, n, d, ℓ, t)$ for $j = 1, \dots, q$ . Then, $C$ gives ${{VK}_{w_{j}}, {SP}_{j}}_{j = 1}^{q}$ to $A$ .
Signing query: $A$ issues a random variable $W^{'}$ correlated with $(W_{1}, \dots, W_{q})$ , a message $m_{k}$ , and an index $j \in [1, q]$ of the signing parameter ${SP}_{j}$ . $C$ chooses $w^{'} \leftarrow W^{'}$ correlated with $(w_{1}, \dots, w_{q})$ , and runs Sign $({SP}_{j}, w^{'}, m_{k})$ . $C$ sends the resulting signature to $A$ .
Output: $A$ outputs $(m^{*}, σ^{*})$ such that $σ^{*}$ was not the output of $m^{*}$ queried. If Verify $({VK}_{w_{j}}, σ^{*}, m^{*}) = 1$ for some $j \in {1, \dots, q}$ , $A$ wins the game.

Definition 3 (Reusability).

An FVS scheme

Π_{f}

is reusable in chosen message attacks if, for any PPT adversary

A

making at most

q_{s}

signature queries, there is a negligible function

ν (λ)

such that

\begin{matrix} {Adv}_{Π_{f}, A}^{R E U} (λ) ≜ Pr [A wins] \leq ν (λ) . \end{matrix}

4. Fuzzy Vector Signature

4.1. Construction

Let

G = (p, g, h, G_{1}, G_{2}, G_{T}, e)

be a bilinear group, where

g \in G_{1}

and

h \in G_{2}

are generators. Let

W

be a (biometric) space with the Hamming distance metric, and let

W \in W

be a random variable that is the user source. Given W, we consider two fuzzy (biometric) samples such that

w, w^{'} \in W

. In this section, we present our FVS scheme that consists of the following three algorithms: Setup, Sign, and Verify.

Setup $(1^{λ}, w, n, d, ℓ, t)$ For a security parameter $λ$ , the setup algorithm generates a bilinear group $G$ , and picks a hash function $H : {0, 1}^{*} \to Z_{p}$ . The setup algorithm generates a signing parameter ( $SP$ ) as follows:
- Pick random elements $x_{i}, y_{i} \in Z_{p}$ for $i \in [1, n]$ and set $X_{i} = g^{x_{i}}, Y_{i} = g^{y_{i}} \in G_{1}$ for $i \in [1, n]$ .
- Generate a random element $g_{1} \in G_{1}$ .
- Output $SP = (g, g_{1}, H, {X_{i}, Y_{i}}_{i \in [1, n]})$ .
Let n be the length of a fuzzy (biometric) data w, and d be the number of entire subsets, ℓ be the number of elements included in each subset, and t be the maximum number of errors among n elements, indicating an error tolerance rate. Given a fuzzy (biometric) data $w = (w [1], \dots, w [n]) \leftarrow W$ , the setup algorithm generates a verification key ${VK}_{w}$ as follows:
- Randomly select a set $I_{j} \subset {1, \dots, n}$ where $| I_{j} | = ℓ$ for $j \in [1, d]$ .
- Set a subset $\vec{v_{j}} = {w [i] | i \in I_{j}}$ of a vector w for $j \in [1, d]$ .
- Select random elements $r_{j} \in Z_{p}$ for $j \in [1, d]$ .
- Set $v k_{j} = ({(\prod_{i \in I_{j}} h^{x_{i} + w [i] y_{i}})}^{r_{j}},$ $h^{r_{j}}) \in G_{2}^{2}$ for $j \in [1, d]$ .
The verification key ${VK}_{w}$ is given by
- ${VK}_{w} = ({v k_{1}, \dots, v k_{d}}, {I_{1}, \dots, I_{d}})$ .
Sign $(SP, w^{'}, m)$ . To sign a message m under fuzzy (biometric) data $w^{'} = (w^{'} [1], \dots,$ $w^{'} [n]) \leftarrow W$ , the signing algorithm generates a signature $σ_{w^{'}}$ as follows:
- Choose random elements $s, k \in Z_{p}^{*}$ .
- Set $σ_{(1, i)} = {(X_{i} Y_{i}^{w^{'} [i]})}^{s}$ for $i \in [1, n]$ .
- Set $σ_{2} = g^{s} \in G_{1}$ .
- Set $σ_{3} = g_{1}^{s} \in G_{1}$ .
- Set $σ_{4} = H (g^{k}, g_{1}^{k}, σ_{2}, σ_{3}, {σ_{(1, i)}}_{i \in [1, n]}, m)$
- Set $σ_{5} = k + σ_{4} \cdot s \in Z_{p}$ .
- Output $σ_{w^{'}} = ({σ_{(1, i)}}_{i \in [1, n]},$ $σ_{2}, σ_{3}, σ_{4}, σ_{5})$ .
Verify $({VK}_{w}, σ_{w^{'}}, m)$ . To verify a signature $σ_{w^{'}} = ({σ_{(1, i)}}_{i \in [1, n]},$ $σ_{2}$ , $σ_{3}$ , $σ_{4}$ , $σ_{5})$ on a message m under the verification key ${VK}_{w} = ({v k_{1}$ , …, $v k_{d}}$ , ${I_{1}$ , …, $I_{d}})$ , the verification algorithm proceeds as follows:
- Set $v k_{j} = (v k_{1, j}, v k_{2, j})$ for $j \in [1, d]$ .
- Set $c n t = 0$ .
- While $c n t = 0$ for $j = 1, \dots, d$ :
  (1)
  Compute $A = \prod_{i \in I_{j}} σ_{(1, i)}$ .
  (2)
  If $e (σ_{2}, v k_{1, j}) = e (A, v k_{2, j})$ , $c n t = 1$ .
  If $c n t = 0$ , output 0.
- Otherwise, set $B = g^{σ_{5}} σ_{2}^{- σ_{4}}$ and $B_{1} = g_{1}^{σ_{5}} σ_{3}^{- σ_{4}}$ .
- If $σ_{4} = H (B, B_{1}, σ_{2}, σ_{3}, {σ_{(1, i)}}_{i \in [1, n]}, m)$ , output 1; otherwise 0.

4.2. Setting the Number of Subsets

Let

δ

be the probability that the Verify algorithm outputs 0, meaning that it fails to verify a signature

σ_{w^{'}}

using

{VK}_{w}

. Thus, if

δ = 1 / 2

is set, then a signer would produce a signature two times with overwhelming probability of generating a valid signature. Following [17], we show that the number d of subsets is determined by setting the value

δ

. Given

(n, ℓ)

, we assume that our FVS scheme wants to tolerate at most t errors among n elements. During verification, the probability that

e (σ_{2}, v k_{1, j}) = e (A, v k_{2, j})

is at least

{(1 - \frac{t}{n})}^{ℓ}

for each

j = 1, \dots, d

. Thus, the probability that the Verify algorithm outputs 0 is at most

{(1 - {(1 - \frac{t}{n})}^{ℓ})}^{d}

, meaning that no (biometric) vector

{\vec{v}}_{j}

matches the vector corresponding with

w^{'}

. If we set the failure probability

δ

as

\begin{matrix} {(1 - {(1 - \frac{t}{n})}^{ℓ})}^{d} \approx δ, \end{matrix}

the approximation

e^{x} \approx x + 1

gives the relations, such as

{(1 - {(1 - \frac{t}{n})}^{ℓ})}^{d} \approx {(1 - e^{- \frac{t ℓ}{n}})}^{d} \approx exp (- d e^{- \frac{t ℓ}{n}})

. Consequently, we obtain the relation

\begin{matrix} d \approx e^{\frac{t ℓ}{n}} \cdot ln \frac{1}{δ}, \end{matrix}

as required to determine the number d of subsets.

4.3. Correctness

Assume that dis

(w, w^{'})

= γ \leq t

. Then, the probability that

e (σ_{2}, v k_{1, j}) = e (A, v k_{2, j})

for any

j \in [1, d]

is at least

{(1 - \frac{γ}{n})}^{ℓ}

. In this case, since

{(1 - \frac{γ}{n})}^{ℓ}

\geq {(1 - \frac{t}{n})}^{ℓ}

because of

γ \leq t

, we can derive the probability that the Verify algorithm outputs 0 as

\begin{matrix} {(1 - {(1 - \frac{γ}{n})}^{ℓ})}^{d} \leq {(1 - {(1 - \frac{t}{n})}^{ℓ})}^{d} \approx δ, \end{matrix}

by the above approximation. As a result, if dis

(w, w^{'})

= γ \leq t

,

Pr [

Verify $({VK}_{w}, σ_{w^{'}}, m)$

= 1]

\geq 1 - δ

.

4.4. Security

Theorem 1.

If

W

is a family of sources over

Z^{n}

with α-entropy ℓ-samples, then the FVS scheme is VK-private for such a

W

.

Proof.

Before proving the VK-privacy, we first prove that a family

H

of functions

{f_{SP} : Z^{ℓ} \to G_{2}}

generating the partial verification key is a

1 / p

-universal hash function for a fixed

SP

. In our FVS scheme, given a vector

\vec{v_{j}} = {w [i] | i \in I_{j}}

as input, the function

f_{SP} (\vec{v_{j}})

is defined as follows:

\begin{matrix} f_{SP} (\vec{v_{j}}) = {(\prod_{i \in I_{j}} h^{x_{i} + w [i] y_{i}})}^{r_{j}}, \end{matrix}

where

r_{j}

is a randomly chosen exponent

\in Z_{p}

. Since

r_{j}

is raised for each input, it is easy to see that two outputs of the function could be equal with probability

1 / p

. Thus, it holds that for two distinct inputs

\vec{v_{i}}

and

\vec{v_{j}}

,

Pr [f_{SP} (\vec{v_{i}}) = f_{SP} (\vec{v_{j}})] = \frac{1}{p} = \frac{1}{| G_{2} |}

; thus,

H = {f_{SP}}

is

1 / p

-universal.

Next, we prove that a verification key

{VK}_{w}

corresponding with

w \in W

is almost close to uniform distribution based on Lemma 3. After that, we extend the result to the case of a polynomial number of verification keys. Let

W = (W [1],

. . ., W [n])

be a random variable of a source with

α

-entropy ℓ-samples. If

V_{j} = {W [i] | i \in I_{j}}

, we see that

(V_{1}, . . ., V_{d})

is a joint distribution of d subsets of

W \in W

such that

{\tilde{H}}_{\infty} [V_{j} | I_{j}] \geq α

for random sets of indices

(I_{1}, . . ., I_{d})

. In addition, as mentioned above, each function

f_{SP}

is a

1 / p

-universal hash function with respect to a distinct

j \in [1, d]

.

By Lemma 2(a), if a random set of indices

I_{j}

is set to be

I_{i}

, then

H_{\infty} [V_{j} | I_{j} = I_{j}] \geq {\tilde{H}}_{\infty} [V_{j} | I_{j}] - log (1 / ξ)

with a probability of at least

1 - ξ

. Let

f_{SP} (V_{j})

be a random variable for a set

V_{j}

of indices for

j \in [1, d]

and let

(y_{j + 1}, \dots, y_{d})

be a sample of

(f_{SP} (V_{j + 1}), \dots, f_{SP} (V_{d}))

. By a similar proof A1 (proof of Lemma 3), we obtain the following result:

\begin{matrix} {\tilde{H}}_{\infty} [V_{j} | I_{j}, f_{SP}, \dots, f_{SP}, {f_{SP} (V_{k}) = y_{k}}_{k = j + 1}^{d}] \\ \geq {\tilde{H}}_{\infty} [V_{j} | I_{j}, f_{SP}, \dots, f_{SP}] - (d - j) log | G_{2} | - log (1 / ϵ) \\ = {\tilde{H}}_{\infty} [V_{j} | I_{j}] - (d - j) log | G_{2} | - log (1 / ϵ) \\ \geq α - (d - j) log | G_{2} | - log (1 / ϵ), \end{matrix}

where

ϵ

shows negligible probability determined by a security parameter. As a result, if

α \geq d \cdot log | G_{2} | + 3 log (1 / ϵ) + log (1 / ξ) + Θ (1)

, we have

\begin{matrix} H_{\infty} [V_{j} | I_{j} = I_{j}] \\ \geq α - (d - j) log | G_{2} | - log (1 / ϵ) - log (1 / ξ) \\ = j log | G_{2} | + 2 log (1 / ϵ) + Θ (1) \\ \geq log | G_{2} | + 2 log (1 / ϵ) + Θ (1) . \end{matrix}

For

j \in [1, d]

, we can show that

H_{\infty} [V_{j} | I_{j} = I_{j}] \geq log | G_{2} | + 2 log (1 / ϵ) + Θ (1)

, similarly as in the above process.

Let

E X P . r e a l = (f_{SP} (\vec{v_{1}}), \dots, f_{SP} (\vec{v_{d}}))

for some samples (

V_{1} = \vec{v_{1}}, \dots, V_{d} = \vec{v_{d}}

), and let

E X P . r a n d

=

(u_{1}, \dots, u_{d})

be random samples chosen from

G_{2}^{d}

. Because of the above inequality, under the assumption that

α \geq d \cdot log | G_{2} | + 3 log (1 / ϵ) + log (1 / ξ) + Θ (1)

, we see via Lemma 2.3 that, for any (

V_{1}, \dots, V_{d}

) and

(f_{SP}, \dots, f_{SP})

, the statistical distance between

E X P . r e a l

and

E X P .

r a n d

is less than

2 ϵ d

.

We can extend the above result of a single verification key to correlated random variables

W = (W_{1}, \dots, W_{q})

. Let

(V_{i 1}, \dots, V_{i d})

be a joint distribution of d subsets

(I_{i 1}, \dots, I_{i d})

for

i \in [1, q]

. If

α \geq d \cdot q \cdot log | G_{2} | + 3 log (1 / ϵ) + log (1 / ξ) + Θ (1)

, the statistical distance between

E X P . r e a l = ({VK}_{w_{1}}

, …,

{VK}_{w_{k}}

, …,

{VK}_{w_{q}})

and

E X P . r a n d = ({VK}_{w_{1}}, \dots, R_{k},

\dots,

{VK}_{w_{q}})

is less than

2 ϵ (q \cdot d - (q \cdot d - d + 1) + 1) = 2 ϵ d

for any

k \in [1, q]

where

{VK}_{w_{i}} = (f_{{SP}_{i}} (\vec{v_{i 1}}), \dots, f_{{SP}_{i}} (\vec{v_{i d}}))

and

R_{k} = (u_{1}, \dots, u_{d})

for uniformly random

u_{i} \in G_{2}^{d}

. □

Theorem 2.

If the FVS scheme is VK-private and the XDH assumption holds in

G_{1}

, the FVS scheme is SIG-private in the random oracle model.

In reality, if verification keys and signatures were to become revealed to an adversary, we would have to prove that it is infeasible for the adversary to glean any information on the fuzzy (biometric) data from the signatures. As mentioned above, however, we prove from Theorem 4.1 that the VK-privacy holds, meaning that it is difficult for the adversary to get some information on the fuzzy data from verification keys. In addition, the setup algorithm chooses a new signing parameter

SP

each time a verification key is generated for each fuzzy data. Thus, it is sufficient to show that a signature generated under a fixed

SP

does not reveal any information about a challenged fuzzy data.

To do this, a simulator chooses a challenge sample

w^{*}

along with other correlated samples. For the length n of

w^{*}

, we create the following sequence of games where

w^{*}

is used for generating a challenge signature:

\begin{matrix} Game 0 : w^{*} = (w^{*} [1], w^{*} [2], \dots, w^{*} [n]) \\ Game 1 : w^{*} = (R_{1}, w^{*} [2], \dots, w^{*} [n]) \\ ⋮ \\ Game α : w^{*} = (R_{1}, \dots, R_{α}, w^{*} [α + 1], \dots, w^{*} [n]) \\ ⋮ \\ Game n : w^{*} = (R_{1}, \dots, R_{n}) . \end{matrix}

In Game 0, the challenge signature is generated for the original

w^{*}

as in a real game, whereas in Game n the challenge signature is generated for a random vector and thus does not have any information on the original

w^{*}

. For proving the SIG-privacy, it is sufficient to show that it is infeasible for the adversary to distinguish between Game

(α - 1)

and Game

α

under the DDH problem.

Lemma 4.

Under the XDH assumption in

G_{1}

, it is infeasible to distinguish between Game

(α

-1) and Game α.

Proof.

Given an XDH instance

(g, h, g^{a}, g^{b}, T) \in G_{1} \times G_{2} \times G_{1}^{3}

, a challenger

C

interacts with an adversary

A

who tries to break the the SIG-privacy of our FVS scheme.

Setup. $C$ chooses samples $(w^{*}, w_{1}, \dots, w_{q}) \leftarrow (W^{*}$ , $W_{1}$ , …, $W_{q})$ , where those $q + 1$ correlated distributions are from $A$ . In particular, for a sample $w^{*} = (w^{*} [1], \dots, w^{*} [n])$ corresponding to $W^{*}$ , $C$ sets $w^{*} [1] = R_{1}, \dots, w^{*} [α - 1] = R_{α - 1}$ for randomly chosen values $R_{1}, \dots, R_{α - 1} \in Z_{p}$ . For each $w_{j}$ , $C$ generates $({VK}_{w_{j}}, {SP}_{j})$ for $j = 1, \dots, q$ as follows:
-
Choose random values $ϕ$ and ${x_{j i}, y_{j i}}_{i = 1}^{n}$ in $Z_{q}$ .
-
Set ${SP}_{j} = {g, g^{ϕ}, g^{x_{j i}}, g^{y_{j i}}}_{i = 1}^{n}$ .
-
Select d sets of random indices $I_{m}$ such that $| I_{m} | = ℓ$ .
-
Set a subset $\vec{v_{j m}} = {w_{j} [i] | i \in I_{m}}$ for $m \in [1, d]$ .
-
Choose ${r_{j m}}_{m = 1}^{d}$ at random in $Z_{p}$ .
-
Set $v k_{j m} = (h^{r_{j m} (\sum_{i \in I_{m}} x_{j i} + y_{j i} w_{j} [i])}, h^{r_{j m}})$ .
For the target $w^{*}$ , $C$ chooses ${x_{i}^{*}, y_{i}^{*}}_{i = 1}^{n}$ in $Z_{p}$ , and sets ${X_{i}^{*} = g^{x_{i}^{*}}, Y_{i}^{*} = g^{y_{i}^{*}}}_{i = 1, i \neq α}^{n}$ and $X_{α}^{*} = g^{a} g^{x_{α}^{*}},$ $Y_{α}^{*} = g^{y_{α}^{*}}$ , under the implicit setting of ${\tilde{x}}_{α}^{*} = a + x_{α}^{*}$ . $C$ then sets ${SP}^{*} = (g, g^{ϕ}, {X_{i}^{*}, Y_{i}^{*}}_{i = 1}^{n}$ ) and gives ${SP}^{*}$ along with $({VK}_{w_{j}}, {SP}_{j})$ for $j = 1, \dots, q$ to $A$ .
Signing queries. $A$ issues a pair of a correlated distribution $W^{'}$ with $(W^{*}$ , $W_{1}$ , …, $W_{q})$ , an index $j \in [1, q]$ of signing parameter ${SP}_{j}$ , and a message m. Then, $C$ chooses a sample $w^{'} = (w^{'} [1], \dots, w^{'} [n]) \leftarrow W^{'}$ correlated with $(w^{*}, w_{1}, . . ., w_{q})$ . $C$ performs the ordinary signature generation algorithm by taking $({SP}_{j}, w^{'}, m)$ as inputs.
Challenge. $A$ sends a message $m^{*}$ to $C$ . In particular, we use a non-interactive zero-knowledge (NIZK) simulator to generate two elements $(σ_{4}^{*}, σ_{5}^{*})$ without knowing the witness. $C$ generates a challenge signature as follows:
-
Set $σ_{(1, i)}^{*} = {(g^{b})}^{(x_{i}^{*} + y_{i}^{*} \cdot w^{*} [i])}$ for $i \in [1, n]$ and $i \neq α$ .
-
Set $σ_{(1, α)}^{*} = T {(g^{b})}^{(x_{α}^{*} + y_{α}^{*} \cdot w^{*} [α])}$ .
-
Set $σ_{2}^{*} = g^{b}$ .
-
Set $σ_{3}^{*} = {(g^{b})}^{ϕ}$ .
-
Obtain $(σ_{4}^{*}, σ_{5}^{*})$ using the NIZK simulator.
$C$ sends $σ_{w^{*}}^{*} = ({σ_{(1, i)}^{*}}_{i = 1}^{n}, σ_{2}^{*}, σ_{3}^{*}, σ_{4}^{*}, σ_{5}^{*})$ to $A$ .
Guess. $A$ outputs a guess $b \in 0, 1$ in response to the challenge signature.

If

T = g^{a b}

, then

A

interacts with

B

in Game $(α - 1)$ , because

σ_{(1, α)}^{*} = T {(g^{b})}^{(x_{α}^{*} + y_{α}^{*} \cdot w^{*} [α])} = {(g^{a + x_{α}^{*}} {(g^{y_{α}^{*}})}^{w^{*} [α]})}^{b} = {(X_{α}^{*} {(Y_{α}^{*})}^{w^{*} [α]})}^{b}

. Otherwise,

σ_{(1, α)}^{*}

becomes a random element, in which case

A

is in Game $α$ . Therefore, depending on the ability of

A

,

C

is able to solve the given XDH problem. □

Theorem 3.

If the FVS is VK-private and SIG-private and the DL assumption holds in

G_{1}

, the FVS scheme is reusable in random oracle model.

The proof for Theorem 4.3 is almost the same as the proof for unforgeability in Reference [20], but the difference resides in the fact that an adversary in our reusability proof is given verification keys and signatures that correspond to correlated fuzzy (biometric) data. In other words, even if such correlated fuzzy data is reused, our proof shows that it is difficult for the adversary to generate a valid signature with (unknown) target fuzzy data. To prove reusability, we need the VK-privacy and SIG-privacy to guarantee that the verification keys and signatures exposed to the adversary do not reveal any information about the fuzzy data. At this point, there are two strategies we anticipate that the adversary might take in their forgery. The first is that the adversary would guess

w^{'}

from a certain distribution W and then generate a signature on the input

(SP, w^{'}, m)

. However, as long as W is assumed to be a distribution with

α

-entropy ℓ-samples and

α

is sufficiently large with respect to the security parameter, such a strategy can be successful with negligible probability only.

The other strategy is to reuse a previous signature without changing the fuzzy data

w^{'}

embedded into it. More specifically, there are two prongs to this strategy. One is to re-randomize the previous signature by raising a new random exponent

s^{'}

into the elements

{σ_{(1, i}}_{i = 1}^{n}

,

σ_{2}

, and

σ_{3}

. Thus, the discrete logarithm of such elements becomes

s \cdot s^{'}

, where s was chosen by a signer and

s^{'}

is now selected by an adversary. The important point is that the adversary still cannot know the exact discrete logarithm

s \cdot s^{'}

, which is then the witness necessary for generating the other signature elements

(σ_{4}, σ_{5})

as an NIZK proof as it relates to

s \cdot s^{'}

. However, generating such an NIZK proof equates to breaking the statistical soundness of proving the equality of discrete logarithms [28] in regard to the unknown witness. Thus, the probability that the adversary would succeed is at most

q_{h} / p

, where

q_{h}

is the number of H-oracle queries, which is also negligible. Now, the remaining case is to reuse the previous signature as it is and simply reconstruct a new proof

(σ_{4}, σ_{5})

. Fortunately, this case can be reduced to the forgery of a one-time multi-user Schnorr signature [29], which is provably secure under the DL assumption. In our proof, a slight variant of the one-time multi-user Schnorr signature proves the equality of discrete logarithms rather than proving knowledge. In line with this, the variant is also provably unforgeable [30,31] against chosen-message attacks in a multi-user setting (MU-SUF-CMA). Based on the variant we use, we show that, under the DL assumption, it is difficult for the adversary to succeed in the remaining case.

Proof.

A simulator

B

uses an adversary

A

(which breaks the reusability of the FVS scheme) as a subroutine to forge a signature in the one-time multi-user Schnorr signature scheme. Let

q_{s} \leq ρ

for the number

q_{s}

of signature queries. Given

ρ

independent public keys

(g, g_{1}, {(g^{s_{1}}, g_{1}^{s_{1}}), \dots, (g^{s_{ρ}}, g_{1}^{s_{ρ}})})

of the one-time multi-user Schnorr signature scheme,

B

interacts with

A

as follows:

Setup. $A$ gives correlated random variables $(W_{1}, \dots, W_{q})$ $\in W^{q}$ , of which each is in $Z^{n}$ . $B$ samples $(w_{1}, \dots, w_{q}) \leftarrow (W_{1}, \dots, W_{q})$ and runs Setup $(1^{λ}, n, d, ℓ, w_{j}, t)$ to obtain a signing parameter ${SP}_{j}$ and a verification key ${VK}_{w_{j}}$ for $j = 1, \dots, q$ . In the setup algorithm, $(x_{j 1}, \dots, x_{j n})$ and $(y_{j 1}, \dots, y_{j n})$ are selected uniformly at random in $Z_{p}$ . $B$ gives ${({SP}_{j},$ ${VK}_{w_{j}} {)}}_{j = 1}^{q}$ to $A$ .
Signing queries. For $j \in [1, q]$ , $A$ issues signing queries with input, namely a random variable $W^{'}$ correlated with $(W_{1}$ , …, $W_{q})$ , an index of signing parameter ${SP}_{j}$ , and a message $m_{k}$ . $B$ responds to the query as follows:
-
Choose a sample $w^{'} = (w^{'} [1], \dots, w^{'} [n])$ correlated with $(w_{1}, \dots, w_{q})$ from $W^{'}$ .
-
Generate $σ_{1, i} = {(g^{s_{k}})}^{x_{j i} + y_{j i} w^{'} [i]}$ for $i \in [1, n]$ , and set $M_{k} = (g^{s_{k}}, g_{1}^{s_{k}}, {σ_{1, i}}_{i \in [1, n]}, m_{k})$ .
-
Query $(j, M_{k})$ to the signing oracle of the one-time multi-user Schnorr signature scheme, meaning a signing query on a message $M_{k}$ under the j-th public key, and receive $(h_{k}, c_{k})$ .
-
Set $σ_{k} = ({\{σ_{1, i}\}}_{i \in [1, n]}, g^{s_{k}}, g_{1}^{s_{k}}, h_{k}, c_{k})$ and give $σ_{k}$ to $A$ .
Output. $A$ outputs $(m^{*}, σ^{*}) = (m^{*}, ({\{σ_{1, i}^{*}\}}_{i \in [1, n]}$ , $σ_{2}^{*}, σ_{3}^{*}, σ_{4}^{*}, σ_{5}^{*}))$ . $B$ checks if $(m^{*}, σ^{*})$ $\neq (m_{k}, σ_{k})$ for any $k \in [1, q_{s}]$ , and finds $k^{*}$ such that $σ_{2}^{*} = g^{s_{k^{*}}}, σ_{3}^{*} = g_{1}^{s_{k^{*}}}$ . After finding the index j corresponding with the $k^{*}$ -th query, $B$ checks if Verify $({VK}_{w_{j}}$ , $σ^{*}$ , $m^{*})$ outputs 1. If it does, $B$ outputs a forgery of the one-time multi-user Schnorr signature scheme as follows:
-
Set $M^{*} = (σ_{2}^{*}, σ_{3}^{*}, {\{σ_{1, i}^{*}\}}_{i \in [1, n]}, m^{*})$ .
-
Output $(k^{*}, M^{*}, (σ_{4}^{*}, σ_{5}^{*}))$ .

It follows that, as longs as

A

breaks the reusability of the FVS scheme, then

B

can break the MU-SUF-CMA security of the Schnorr signature scheme. Indeed, if Verify

({VK}_{w_{j}}

,

σ^{*}

,

m^{*})

= 1, this means that

(σ_{4}^{*}, σ_{5}^{*})

is the valid signature of the message

(σ_{2}^{*}, σ_{3}^{*}, {σ_{1, i}^{*}}_{i \in [1, n]}, m^{*})

under the

k^{*}

-th public key

(σ_{2}^{*}, σ_{3}^{*})

.

Since it is clearly proven that the variant of the one-time multi-user Schnorr signature scheme is MU-SUF-CMA secure under the DL assumption in the random oracle model [29], the reusability of our FVS scheme can also be proven in the random oracle model under the DL problem. □

5. Performance Analysis

Our FVS scheme is constructed using the subset-based sampling method of the re-usable fuzzy extractor [17], which does not require fuzzy (biometric) input data to have sufficiently high min-entropy and can still tolerate a sub-linear fraction of errors. Generally, biometric data sources are non-uniformly and uniquely distributed from person to person; thus, it is not easy to expect that such biometric data will have high min-entropy. Nevertheless, most reusable fuzzy extractors [8,13,14,15] based on secure sketches require source data with high min-entropy because a secure sketch is known to cause an entropy loss of a biometric input data. For comparison, we focus on fuzzy extractors [17,18] that do not rely on secure sketches, and therefore not require a high min-entropy source. A reusable fuzzy extractor suggested by Reference [12] is not based on a secure sketch and tolerates a linear fraction of errors, but it uses a so-called pseudoentropic isometry that requires each biometric input data component to have high min-entropy. This requirement is also far from realistic biometric information. A reusable fuzzy extractor suggested by Apon et al. [16] is constructed based on the hardness of Learning with Errors (LWE) problems, where biometric data is injected into the part with LWE errors. In that case, such biometric data must follow a certain error distribution (e.g., Gaussian) in order to ensure the security of LWE problems. This reveals a limitation in its real-world application potential. Furthermore, the LWE-based fuzzy extractor [16] requires a time-consuming reproducing algorithm, where another sample of biometric data is subtracted from an LWE instance that was previously created per each component, and then a randomly chosen linear system is expected to be solved. A problem arises if any error components in the chosen linear system are not 0, at which point a new linear system must be randomly reselected until achieving success. The same problem is found in the reusable fuzzy signature [19] that follows the LWE-based fuzzy extractor technique. To mitigate the reproducing problem, [16,19] should be limited to dealing with only a logarithmic size of errors.

Eventually, we compare our FVS scheme with previous fuzzy extractors [17,18] and the original FVS scheme [20], which all follow the subset-based sampling technique [17]. We specifically consider authentication protocols where a fuzzy extractor or an FVS scheme is instantiated to authenticate a user using biometric data. During protocol executions, we compare our proposed scheme with existing schemes in terms of storage or transmission costs and computational costs on the part of the user. In regards to the fuzzy extractor, we assume that a digital signature scheme

S = (KeyGen,

Sign,

Verify)

is additionally provided. As usual, an authentication protocol consists of two phases; enrollment and authentication. In the enrollment phase, a user is registered with an authentication server by sending an identity

I D

, a verification key

v k_{I D}

, and helper data

P_{I D}

. In each authentication phase, a user receives helper data

P_{I D}

from the server, recovers a secret key for signature generation using their biometric data, and returns a signature

σ_{I D}

in response to a challenge message R sent by the server. Figure 1 and Figure 2 present two authentication protocols in more detail.

5.1. Storage or Transmission Costs

In the authentication phase, helper data

P_{I D}

is needed to generate a signing key from user biometric data. There are two options through which the user obtains an

P_{I D}

. The first way is to store it in their own personal device, and the other is to receive it from the server for each instance of authentication. The first method can reduce the amount of transmission, while carrying a personal storage device. Conversely, the second one does not require a personal device, while the server sends out a huge amount of transmission, and it has the advantage in that the user authentication can work on secure devices that are shared by multiple users. For comparison purposes, we present the size of

P_{I D}

as the storage or transmission cost in Table 2.

Let n be the dimension of biometric data, d be the number of subsets, ℓ be the number of elements in each subset, and t be the maximum number of errors among n elements. With fuzzy extractors [17,18], helper data

P_{I D}

consists of an information set, a nonce, and an output of a hash function per subset. To begin, the number d of subsets is obtained via

d \approx ln (1 / δ) \cdot e^{\frac{t ℓ}{n}}

in Reference [17], whereas d is computed as

d \approx m / 2 q

in Reference [18], where

q = \frac{τ (\binom{m}{τ})}{(\binom{n}{t})} \sum_{η = τ}^{m} {(- 1)}^{η - τ} \cdot \frac{(\binom{m - τ}{η - τ}) \times (\binom{n - τ ℓ}{τ})}{t}

for

(τ, m)

-threshold scheme [32]. In this case, the information set per subset has ℓ indices which are represented by

ℓ log n

bits. When using SHA-256 as the hash function, we set the size of a nonce to be sufficient at

176 (= 256 - 80)

bits. As a result, the whole size of

P_{I D}

is about

d \cdot (ℓ log n + 256 + 176)

for

n =

512, 1024, and 2048 cases, which, as shown in Table 2, becomes huge when setting

t / n = 0.20

as an error tolerance rate and

ℓ = 80

. On the other hand, with FVS schemes, the number of subsets is determined by

d \approx ln (1 / δ) \cdot e^{\frac{t ℓ}{n}}

following [17], but helper data

P_{I D}

consists of only a signing parameter

SP

, regardless of the number of subsets. Indeed, in Figure 3,

SP

in Reference [20] is

3 n + 1

elements in

G_{1}

, whereas

SP

in ours is

2 n + 2

elements in

G_{1}

. When taking the Type-3 pairing [33] at the 100-bit security level, the size of elements in

G_{1}

and

Z_{p}

is 256 bits. For

n = 512

, 1024, and 2048 cases, Table 2 shows that the

P_{I D}

size of the FVS schemes are overwhelmingly smaller than that of the fuzzy extractors. Compared to Reference [20], our FVS scheme obtains a slightly shorter size of

SP

with the same parametrization. Regarding the signature size, the

σ_{I D}

in our scheme consists of

n + 2

elements in

G_{1}

plus 2 elements in

Z_{p}

that are transmitted to the server. When taking the Type-3 paring and

n = 512

again, the amount of

σ_{I D}

transmission is about

(512 + 4) \times 256 / 2^{13} = 16

KB.

δ = 1 / 2

is the the probability that verification fails, and we expect the user to run step [A2] of the authentication protocols twice, during which the transmission cost of

σ_{I D}

becomes about 32 KB.

5.2. Computation Cost

For the fuzzy extractor, we considered the computational cost required for obtaining a signing key

K_{I D}

by running the

FE . Rep

algorithm that takes as input helper data

P_{I D}

and a biometric data

b i o^{'}

. This is the step for [A2] in Figure 1. We assume that a reproduced value from

FE . Rep

is straightforwardly used as the singing key corresponding with the verification key

v k_{I D}

. If a hash function H, such as SHA-256, is used as a digital locker [17],

K_{I D}

is locked with

({nonce}_{i}, H ({nonce}_{i}, {b i o}_{i}) ⨁ (K_{I D} | | 0^{s}))

for a positive integer s, where

{b i o}_{i}

is the set of biometric data corresponding to a subset i among n components. Therefore, with a new

b i o^{'}

, the unlocking algorithm needs to perform a hash computation

| H |

plus an XOR operation

| X |

per each subset i until

K_{I D}

is obtained. Consequently, the

FE . Rep

algorithm in Reference [17] must compute

d \cdot (| H | + | X |)

operations as a worst case scenario. Since [18] also requires solving a

(τ, m)

-secret sharing scheme, the

FE . Rep

algorithm chooses a set of

τ

shares among m unlocked values and then solves a secret-sharing scheme, leading to performing additional

\frac{d}{m} ((\binom{m}{τ}) \cdot τ m (m - 1)) | X |

operations. In contrast, the FVS scheme needs to run the

FVS . Sign

algorithm to generate a signature

σ_{I D}

by taking

(b i o^{'}, SP, R)

. This is the step for [A2] in Figure 2. In our FVS scheme, the

FVS . Sign

algorithm needs

(n + 4)

exponentiations in

G_{1}

for the dimension n of biometric data. Compared to Reference [20], the signing operation is reduced by about half, which is shown in Table 3.

In order to measure the actual amount of calculation, we considered how much computation the user should perform by substituting the numbers in Table 2 directly. For instance, let

n = 512

and

ℓ = 80

. The fuzzy extractor by Canetti et al. [17] must complete

(61.6 \times 10^{5}) (| H | + | X |)

operations, and when setting

(τ, m) = (5, 32)

, the fuzzy extractor by Cheon et al. [18] needs to compute approximately

(53.6 \times 10^{3}) | H | + (16.7 \times 10^{11}) | X |

operations—which still seems to be a burdensome amount of calculation to perform on a personal device. To compare, when considering

0.3

ms for an exponentiation in

G_{1}

[34], the

FVS . Sign

algorithm in ours takes about

(512 + 4) \cdot 0.3 \approx 155

ms. Since

δ = 1 / 2

, the user is required to run the step [A2] of the authentication protocols twice, such that the

FVS . Sign

algorithm, which is computed by the user, takes about 310 ms.

5.3. Implementation

We implemented our fuzzy vector signature as a C program to measure actual time consumption. All implementations are performed on an Intel Core i7-8700k with 8GB RAM running Ubuntu 18.04 LTS, and GNU GCC version 7.5.0 is used for the compilation. For implementation, we selected the BLS12381 curve that offer around a 128-bit security level and a SHA-256 for a hash function. In a BLS12381 curve, the size of a element of

G_{1}

and

G_{2}

are 192 bit and 384 bit, respectively. Our implementation codes are available to https://github.com/Ilhwan123/FVS.

For each error rate 5% and 10%, we measured the time consumption required for running our scheme several times. Table 4 presents parameter setting, storage size, and time required for each algorithm, which is the results of our implementation. Compared with Table 2, the size of the

G_{1}

group element is 192 bits, which is less than 256 bits. Therefore, the size of a signing parameter is smaller. However, this depends on the curve type, so it can be different depending on which curve is selected.

Note that, if signature verification fails in the parameter setting in Table 4, the total verification time takes about 56 s.

6. Conclusions

In this paper, we presented an FVS scheme improved upon across all aspects of efficiency and security that is, more strictly speaking, based on the subset-based sampling method [17]. Compared to the original FVS scheme [20], we reduced the size of the signing parameter and the verification key to approximately two-thirds their original sizes and cut the signature size by about half. In addition, we reduced the number of pairings necessary for signature verification to about two-thirds the original number.

We prove that our FVS scheme is VK-private and SIG-private, meaning that the verification key and signatures generated using user correlated fuzzy (biometric) data do not reveal any information about the fuzzy input data. Additionally, instead of the unforgeability of Reference [20], we define the reusuability property which guarantees that a user is able to reuse their fuzzy correlated (biometric) data to generate polynomially-many verification keys, all while still making it infeasible for an adversary to forge a signature without any fuzzy (biometric) data. Under the reusability notion, we can prove that our FVS scheme is reusable, assuming that our FVS scheme is {VK, SIG}-private and the DL assumption holds.

In the remote authentication protocol of our FVS scheme, a user must receive the signing parameter and transmit a signature in response to a random challenge message. The primary advantage of FVS-based (biometric) authentication is that the transmission cost, including the signing parameter and signature, is determined only by the number of dimensions with respect to the fuzzy (biometric) data, not by the number of entire subsets. Thus, unlike the authentication protocol with a fuzzy extractor, the transmission cost between the user and the authentication server becomes remarkably smaller. However, the disadvantage of our FVS-based authentication scheme is that the server is required to perform pairing operations by the number of entire subsets, which is the worst case scenario. Such a burden may be somewhat alleviated by utilizing the computing power of the server in parallel, but it could be more desirable to build a new FVS scheme that supports efficient batch verification operations in the future.

Author Contributions

Conceptualization, J.H.P.; Formal analysis, I.L., and D.H.L.; methodology, I.L., and M.S.; validation, M.S., and J.H.P.; writing—original draft preparation, I.L.; writing—review and editing, M.S., and J.H.P.; supervision, D.H.L.; project administration, D.H.L.; All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by Institute for Information & communications Technology Promotion (IITP) grant funded by the Korea government (MSIT) (No.2016-6-00600, A Study on Functional Encryption: Construction, Security Analysis, and Implementation).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Lemma 3

Proof.

Let

(H_{1}, \dots, H_{q}) \leftarrow H^{q}

and

(U_{1}, \dots, U_{q})

be q independent and uniform distributions over V. Let

D = (H_{1}

,

H_{1} (X_{1})

, ⋯,

H_{q}

,

H_{q} (X_{q}))

and

D_{i} = (H_{1}

,

U_{1}

, ⋯,

H_{i}

,

U_{i}

,

H_{i + 1}

,

H_{i + 1} (X_{i + 1})

, ⋯,

H_{q}

,

H_{q} (X_{q}))

be two distributions for

i \in [1, q]

. By mathematical induction, we prove that D and

D_{i}

are

2 ϵ i

-close for

i \in [1, q]

.

For

i = 1

, Lemma 2 shows that with a probability of at least

1 - ϵ

over the sample of

(y_{2}, \dots, y_{q}) \leftarrow (H_{2} (X_{2}), \dots, H_{q} (X_{q}))

, it holds that

\begin{matrix} H_{\infty} [X_{1} | H_{2}, \dots, H_{q}, {H_{i} (X_{i}) = y_{i}}_{i = 2}^{q}] \\ \geq H_{\infty} [X_{1} | H_{2}, \dots, H_{q}] - (q - 1) log | V | - log (1 / ϵ) \\ = H_{\infty} [X_{1}] - (q - 1) log | V | - log (1 / ϵ) \\ \geq α - (q - 1) log | V | - log (1 / ϵ) \\ = log | V | + 2 log (1 / ϵ) + Θ (1) . \end{matrix}

In this case, the leftover hash lemma [35] implies that the two distributions

D

and

D_{1}

are

2 ϵ

-close.

Next, assuming that the above lemma holds for

i - 1 < q

, we show that the case for i also holds. Lemma 2 shows that, with a probability of at least

1 - ϵ

over the sample of

(y_{i + 1}, \dots, y_{q})

←

(H_{i + 1} (X_{i + 1}), \dots, H_{q} (X_{q}))

, it holds that

\begin{matrix} H_{\infty} [X_{i} | H_{i + 1}, \dots, H_{q}, {H_{j} (X_{j}) = y_{j}}_{j = i + 1}^{q}] \\ \geq H_{\infty} [X_{i} | H_{i + 1}, \dots, H_{q}] - (q - i) log | V | - log (1 / ϵ) \\ = H_{\infty} [X_{i}] - (q - i) log | V | - log (1 / ϵ) \\ \geq α - (q - i) log | V | - log (1 / ϵ) \\ = i log | V | + 2 log (1 / ϵ) + Θ (1) \\ \geq log | V | + 2 log (1 / ϵ) + Θ (1) . \end{matrix}

Similarly, the leftover hash lemma [35] shows that the two distributions

D_{i - 1}

and

D_{i}

are

2 ϵ

-close. It follows that

Δ (D, D_{i}) \leq Δ (D, D_{i - 1}) + Δ (D_{i - 1}, D_{i}) \leq 2 ϵ (i - 1) + 2 ϵ = 2 ϵ i

, which concludes the proof of Lemma 3. □

References

Bruce, V.; Young, A. Understanding face recognition. Br. J. Psychol. 1986, 77, 305–327. [Google Scholar] [CrossRef] [PubMed]
Daugman, J. How iris recognition works. IEEE Trans. Circuits Syst. Video Technol. 2004, 14, 21–30. [Google Scholar] [CrossRef]
Ding, Y.; Zhuang, D.; Wang, K. A study of hand vein recognition method. In Proceedings of the IEEE International Conference Mechatronics and Automation; 2005; Volume 4, pp. 2106–2110. [Google Scholar] [CrossRef]
Jain, A.; Ross, A.; Prabhakar, S. An Introduction to Biometric Recognition. IEEE Trans. Circuits Syst. Video Technol. 2004, 14, 4–20. [Google Scholar] [CrossRef] [Green Version]
Maltoni, D.; Maio, D.; Jain, A.K.; Prabhakar, S. Handbook of Fingerprint Recognition; Springer: London, UK, 2009; ISBN 978-1-84882-254-2. [Google Scholar]
Dodis, Y.; Reyzin, L.; Smith, A. Fuzzy extractors: How to generate strong keys from biometrics and other noisy data. In Proceedings of the Advances in Cryptology (EUROCRYPT 2004), Interlaken, Switzerland, 2–6 May 2004; pp. 523–540. [Google Scholar] [CrossRef] [Green Version]
Takahashi, K.; Matsuda, T.; Murakami, T.; Hanaoka, G.; Nishigaki, M. A signature scheme with a fuzzy private key. In Proceedings of the 13th International Conference on Applied Cryptography and Network Security, New York, NY, USA, 2–5 June 2015; pp. 105–126. [Google Scholar] [CrossRef]
Boyen, X. Reusable Cryptographic Fuzzy Extractors. In Proceedings of the 11th ACM Conference on Computer and Communications Security, Washington, DC, USA, 25–29 October 2004; pp. 82–91. [Google Scholar] [CrossRef] [Green Version]
Daugman, J. Probing the Uniqueness and Randomness of IrisCodes: Results From 200 Billion Iris Pair Comparisons. Proc. IEEE 2006, 94, 1927–1935. [Google Scholar] [CrossRef]
Desoky, A.I.; Ali, H.A.; Abdel-Hamid, N.B. Enhancing iris recognition system performance using templates fusion. Ain Shams Eng. J. 2012, 3, 133–140. [Google Scholar] [CrossRef] [Green Version]
Hollingsworth, K.P.; Bowyer, K.W.; Flynn, P.J. Improved Iris Recognition through Fusion of Hamming Distance and Fragile Bit Distance. IEEE Trans. Pattern Anal. Mach. Intell. 2011, 33, 2465–2476. [Google Scholar] [CrossRef] [PubMed]
Alamélou, Q.; Berthier, P.E.; Cachet, C.; Cauchie, S.; Fuller, B.; Gaborit, P.; Simhadri, S. Pseudoentropic Isometries: A New Framework for Fuzzy Extractor Reusability. In Proceedings of the 2018 on Asia Conference on Computer and Communications Security, Incheon, Korea, 4–8 June 2018; pp. 673–684. [Google Scholar] [CrossRef]
Wen, Y.; Liu, S. Reusable Fuzzy Extractor from LWE. In Proceedings of the Australasian Conference on Information Security and Privacy 2018, Wollongong, Australia, 11–13 July 2018; pp. 13–27. [Google Scholar] [CrossRef]
Wen, Y.; Liu, S.; Han, S. Reusable Fuzzy Extractor from the Decisional Diffie—Hellman Assumption. Des. Codes Cryptogr. 2018, 86, 2495–2512. [Google Scholar] [CrossRef]
Wen, Y.; Liu, S. Robustly Reusable Fuzzy Extractor from Standard Assumptions. In Proceedings of the Advances in Cryptology (ASIACRYPT 20180, Brisbane, Australia, 2–6 December 2018; pp. 459–489. [Google Scholar] [CrossRef]
Apon, D.; Cho, C.; Eldefrawy, K.; Katz, J. Efficient, Reusable Fuzzy Extractors from LWE. In Proceedings of the International Conference on Cyber Security Cryptography and Machine Learning, Be’er Sheva, Israel, 29–30 June 2017; pp. 1–18. [Google Scholar] [CrossRef]
Canetti, R.; Fuller, B.; Paneth, O.; Reyzin, L.; Smith, A. Reusable fuzzy extractors for low-entropy distributions. In Proceedings of the Advances in Cryptology (EUROCRYPT 2016), Vienna, Austria, 8–12 May 2016; pp. 117–146. [Google Scholar] [CrossRef]
Cheon, J.; Jeong, J.; Kim, D.; Lee, J. A Reusable Fuzzy Extractor with Practical Storage Size: Modifying Canetti et al.’s Construction. In Proceedings of the Australasian Conference on Information Security and Privacy 2018, Wollongong, Australia, 11–13 July 2018; pp. 28–44. [Google Scholar] [CrossRef]
Tian, Y.; Li, Y.; Deng, R.H.; Sengupta, B.; Yang, G. Lattice-Based Remote User Authentication from Reusable Fuzzy Signature. Cryptology ePrint Archive: Report 2019/743. Available online: https://eprint.iacr.org/2019/743 (accessed on 24 June 2019).
Seo, M.; Hwang, J.Y.; Lee, D.H.; Kim, S.; Kim, S.; Park, J.H. Fuzzy Vector Signature and Its Application to Privacy-Preserving Authentication. IEEE Access 2019, 7, 69892–69906. [Google Scholar] [CrossRef]
Boneh, D.; Raghunathan, A.; Segev, G. Function-Private Identity-Based Encryption: Hiding the Function in Functional Encryption. In Proceedings of the Advances in Cryptology (CRYPTO 2013), Santa Barbara, CA, USA, 18–22 August 2013; pp. 461–478. [Google Scholar] [CrossRef] [Green Version]
Regev, O. On Lattices, Learning with Errors, Random Linear Codes, and Cryptography. In Proceedings of the Thirty-Seventh Annual ACM Symposium on Theory of Computing (STOC 2005), Baltimore, MA, USA, 22–24 May 2005; pp. 84–93. [Google Scholar] [CrossRef]
Fuller, B.; Meng, X.; Reyzin, L. Computational Fuzzy Extractors. In Proceedings of the advances in Cryptology (ASIACRYPT 2013), Bangalore, India, 1–5 December 2013; pp. 174–193. [Google Scholar] [CrossRef] [Green Version]
Matsuda, T.; Takahashi, K.; Murakami, T.; Hanaoka, G. Fuzzy Signatures: Relaxing Requirements and a New Construction. In Proceedings of the Applied Cryptography and Network Security (ACNS 2016), London, UK, 19–22 June 2016; pp. 97–116. [Google Scholar] [CrossRef]
Yasuda, M.; Shimoyama, T.; Takenaka, M.; Abe, N.; Yamada, S.; Yamaguchi, J. Recovering Attacks Against Linear Sketch in Fuzzy Signature Schemes of ACNS 2015 and 2016. In Proceedings of the Information Security Practice and Experience, Melbourne, Australia, 13–15 December 2017; pp. 409–421. [Google Scholar] [CrossRef]
Takahashi, K.; Matsuda, T.; Murakami, T.; Hanaoka, G.; Nishigaki, M. Signature schemes with a fuzzy private key. In Int. J. Inf. Secur. 2019, 18, 581–617. [Google Scholar] [CrossRef] [Green Version]
Vadhan, S.P. Pseudorandomness. Found. Trends Theor. Comput. Sci. 2012, 7, 1–336. [Google Scholar] [CrossRef]
Bernhard, D.; Pereira, O.; Warinschi, B. How Not to Prove Yourself: Pitfalls of the Fiat-Shamir Heuristic and Applications to Helios. In Proceedings of the Advances in Cryptology (ASIACRYPT 2012), Beijing, China, 2–6 December 2012; pp. 626–643. [Google Scholar] [CrossRef] [Green Version]
Kiltz, E.; Masny, D.; Pan, J. Optimal security proofs for signatures from identification schemes. In Proceedings of the Advances in Cryptology (CRYPTO 2016), Santa Barbara, CA, USA, 14–18 August 2016; pp. 33–61. [Google Scholar] [CrossRef]
Boneh, D.; Boyen, X. Short Signatures Without Random Oracles. In Proceedings of the Advances in Cryptology (EUROCRYPT 2004), Interlaken, Switzerland, 2–6 May 2004; pp. 56–73. [Google Scholar] [CrossRef] [Green Version]
Boneh, D.; Shen, E.; Waters, B. Strongly Unforgeable Signatures Based on Computational Diffie-Hellman. In Proceedings of the Public Key Cryptography (PKC 2006), New York, NY, USA, 24–26 April 2006; pp. 229–240. [Google Scholar] [CrossRef] [Green Version]
Kurihara, J.; Kiyomoto, S.; Fukushima, K.; Tanaka, T. A New (k,n)-Threshold Secret Sharing Scheme and Its Extension. In Proceedings of the Information Security 2008, Taipei, Taiwan, 15–18 September 2008; pp. 455–470. [Google Scholar] [CrossRef] [Green Version]
ISO/IEC 15946-5:2017. Information Technology-Security Techniques-Cryptographic Techniques Based on Elliptic Curves—Part 5: Elliptic Curve Generation; International Organization for Standardization: Geneva, Switzerland, 2017. [Google Scholar]
Bos, J.W.; Costello, C.; Naehrig, M. Exponentiating in Pairing Groups. In Proceedings of the Selected Areas in Cryptography (SAC 2013), Coimbra, Portugal, 18–22 March 2013; pp. 438–455. [Google Scholar] [CrossRef] [Green Version]
HÅstad, J.; Impagliazzo, R.; Levin, L.A.; Luby, M. A Pseudorandom Generator from Any One-Way Function. SIAM J. Comput. 1999, 28, 1364–1396. [Google Scholar] [CrossRef]

Figure 1. Authentication with a fuzzy extractor.

Figure 2. Authentication with our fuzzy vector signature (FVS).

Figure 3. Comparison between previous and our construction.

Table 1. Comparison with reusable biometric cryptosystems.

	Schemes	Secure Sketch	High Min-Entropy	Security Assumption	Error Tolerance Rate	Reusability	Source Distribution
	[8]	O	Y	+	linear	weak	$w_{i} = w + δ_{i}$
	[16]	X	N	LWE	log	Strong	$w_{i} = w + δ_{i}$
	[13]	O	Y	LWE	linear	Strong	$w_{i} = w + δ_{i}$
FE	[15]	O	Y	DDH	linear	Strong	$w_{i} = w + δ_{i}$
	[14]	O	Y	DDH	linear	Strong	$H_{\infty} [w_{i} \| w_{i} - w_{j}] > m$
	[17]	X	N	X	sub-lin	Strong	$(w, w_{i})$
	[18]	X	N	X	sub-lin	Strong	$(w, w_{i})$
	[12]	X	Y	X	linear	Strong	$(w, w_{i})$
FS	[19]	X	N	LWE	log	Strong	$w_{i} = w + δ_{i}$
FVS	[20]	X	N	XDH	sub-lin	∆	$(T, k)$
FVS	Ours	X	N	XDH	sub-lin	Strong	$(w, w_{i})$

• In Secure Sketch, If the scheme used the secure sketch, “O”. Otherwise, “X”. • In Reusability, “weak” means that the scheme is proven in the weak reusability model; • “∆” means that the scheme does not provide a formal proof; • In Source Distribution, “

w_{i} = w + δ_{i}

” means that, for a fuzzy (biometric) source w, the error

δ_{i}

is controlled by an adversary; • “

(w, w_{i})

” means that the biomteric readings w and

w_{i}

are arbitrary correlated; • “

(T, k)

” means the

(T, k)

-block source in Reference [21]; • High Min-entropy means that min-entropy is higher than some value secure against brute-force attack. In High Min-entropy, “Y” is indicated if the scheme requires sufficiently high min-entropy of the input (biometric) data, and “N“ otherwise; • In Security Assumption, “+” means that the scheme is information-theoretically secure.

Table 2. Comparison of storage or transmission costs with helper data.

	n	Error Tolerance Rate $(t / n)$	# of Components In Each Subset $(ℓ)$	# of Subsets $(d)$	Helper Data ( $P_{ID}, SP$ )
[17]	512		80	$61.6 \times 10^{5}$	0.83 GB
	1024	20%	80	$61.6 \times 10^{5}$	0.90 GB
	2048		80	$61.6 \times 10^{5}$	0.96 GB
[18]	512		16 $(\times 5)$	$53.6 \times 10^{3}$	3.76 MB
	1024	20%	20 $(\times 4)$	$26.3 \times 10^{3}$	2.01 MB
	2048		27 $(\times 3)$	$11.6 \times 10^{4}$	10.16 MB
[20]	512		80	$61.6 \times 10^{5}$	0.05 MB
	1024	20%	80	$61.6 \times 10^{5}$	0.09 MB
	2048		80	$61.6 \times 10^{5}$	0.19 MB
Ours	512		80	$61.6 \times 10^{5}$	0.03 MB
	1024	20%	80	$61.6 \times 10^{5}$	0.06 MB
	2048		80	$61.6 \times 10^{5}$	0.13 MB

Table 3. Comparison of computational costs necessary for signature generation.

	Operation
[17]	$d \cdot (\| H \| + \| X \|)$ + Sign
[18]	$\frac{d}{m} ((\binom{m}{τ}) \cdot τ m (m - 1)) \| X \|$ + $d \cdot (\| H \| + \| X \|)$ + Sign
[20]	$(2 n + 2) \| E \|$
Ours	$(n + 4) \| E \|$

∘

| H |

: Hash,

| X |

: XOR,

| E |

: Exponentiation.

Table 4. Implementation results of FVS using BLS12381 curve in case of error tolerance rate

(t / n)

: 12.5%, the number of components in each subset

(ℓ)

: 80, and the number of subsets

(d)

: 15,268 with

δ = 1 / 2

.

Table 4. Implementation results of FVS using BLS12381 curve in case of error tolerance rate

(t / n)

: 12.5%, the number of components in each subset

(ℓ)

: 80, and the number of subsets

(d)

: 15,268 with

δ = 1 / 2

.

n	Signing Parameter	Verification Key	Signature	Setup (s)	Sign (ms)	Verify (s)	Error Rate ${bio}^{'}$
512	24.04 KB	1.95 MB	12.10 KB	23.97	133.86	0.302	5%
512	24.04 KB	1.95 MB	12.10 KB	23.97	133.86	17.954	10%
1024	48.04 KB	1.95 MB	24.11 KB	24.47	264.88	0.254	5%
1024	48.04 KB	1.95 MB	24.11 KB	24.47	264.88	21.830	10%
2048	96.04 KB	1.95 MB	48.10 KB	25.92	532.01	0.377	5%
2048	96.04 KB	1.95 MB	48.10 KB	25.92	532.01	21.364	10%

• Signing Parameter, Verification Key, and Signature means the size of them, respectively, for each fuzzy (biometric) data length n. •Setup, Sign, and Verify means the average time required for each algorithm. • Error rate

{b i o}^{'}

means a percent of difference between a input data of Setup

b i o

and a input data of Sign

{b i o}^{'}

.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lim, I.; Seo, M.; Lee, D.H.; Park, J.H. An Improved Fuzzy Vector Signature with Reusability. Appl. Sci. 2020, 10, 7141. https://doi.org/10.3390/app10207141

AMA Style

Lim I, Seo M, Lee DH, Park JH. An Improved Fuzzy Vector Signature with Reusability. Applied Sciences. 2020; 10(20):7141. https://doi.org/10.3390/app10207141

Chicago/Turabian Style

Lim, Ilhwan, Minhye Seo, Dong Hoon Lee, and Jong Hwan Park. 2020. "An Improved Fuzzy Vector Signature with Reusability" Applied Sciences 10, no. 20: 7141. https://doi.org/10.3390/app10207141

APA Style

Lim, I., Seo, M., Lee, D. H., & Park, J. H. (2020). An Improved Fuzzy Vector Signature with Reusability. Applied Sciences, 10(20), 7141. https://doi.org/10.3390/app10207141

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Improved Fuzzy Vector Signature with Reusability

Abstract

1. Introduction

1.1. Related Work

1.2. Source Distributions

1.3. Contribution

2. Preliminaries

2.1. Notation

2.2. Hamming Distance Metric

2.3. Min-Entropy

2.4. Statistical Distance

2.5. Universal Hash Function

2.6. Discrete Logarithm Assumption

2.7. Bilinear Maps

2.8. External Diffie-Hellman (XDH) Assumption

3. Definitions

3.1. Syntax of Fuzzy Vector Signature

3.2. Security Models

3.2.1. VK-Privacy

3.2.2. SIG-Privacy

3.2.3. Reusability

4. Fuzzy Vector Signature

4.1. Construction

4.2. Setting the Number of Subsets

4.3. Correctness

4.4. Security

5. Performance Analysis

5.1. Storage or Transmission Costs

5.2. Computation Cost

5.3. Implementation

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A. Proof of Lemma 3

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI