1. Introduction
Cryptographic algorithms rarely fail in isolation; real-world security breaches are far more often caused by the difficulty humans face in generating, storing, and reliably recalling high-entropy secrets [
1]. This constraint has driven two decades of research into mnemonic key storage: password managers, brainwallets, and standardized phrase lists such as BIP-39 each represent a different point in the trade-off space between entropy and memorability.
Brainwallets and passphrase-derived keys attempt to eliminate the need for external storage by relying entirely on human memory. However, extensive empirical research has shown that such approaches fail under realistic offline attacker models due to severe entropy collapse caused by predictable human choice [
2,
3]. Large-scale studies of real-world password datasets demonstrate that users consistently select secrets with strong linguistic and cultural bias, resulting in effective entropy levels far below theoretical estimates [
2,
4]. Analyses of funded brainwallets further confirm that passphrase-derived keys are often compromised shortly after use, sometimes within minutes [
3,
5]. Even mnemonic-friendly strategies fail to provide durable protection once human selection bias is taken into account.
In contrast, machine-generated mnemonic schemes such as BIP-39 offer strong cryptographic guarantees by selecting words uniformly from a fixed dictionary, yielding well-defined entropy budgets. A standard 12-word BIP-39 mnemonic provides 128 bits of nominal entropy, making exhaustive brute-force attacks computationally infeasible under conventional assumptions. However, these schemes impose a significant cognitive burden on users, who must reliably store or back up the mnemonic phrase. In practice, this often leads to insecure storage practices or irreversible loss of access, effectively shifting the problem from entropy generation to key retention and recovery.
Despite their differences, password-based and brainwallet approaches share a common dependency: linguistic or abstract symbolic memory, a form of recall that degrades measurably over months and years [
6,
7]. Spatial memory follows a strikingly different retention profile. Layouts of familiar places, routes, and personal landmarks are encoded through hippocampal circuits that govern long-term spatial navigation and are retained with high fidelity even after long intervals without rehearsal [
8,
9,
10,
11]. The difference is not incidental: it reflects distinct cognitive mechanisms, with verbal recall rapidly susceptible to interference and spatial recall comparatively stable.
Prior authentication research has drawn on related intuitions, exploring graphical and map-based schemes [
12,
13,
14,
15]. User studies confirm improved recall, but the security question has been handled qualitatively: none of these systems couples spatial selection to a formal entropy model, quantifies the reduction caused by demographic selection bias, or evaluates the resulting work factor against a GPU-class offline adversary with access to attacker-prioritized spatial dictionaries.
In parallel, advances in cryptographic key derivation functions have made it possible to impose strong, hardware-enforced asymmetries between legitimate users and offline adversaries. Memory-hard functions such as Argon2 were explicitly designed to resist massively parallel GPU and ASIC attacks by binding performance to memory bandwidth and capacity rather than raw compute throughput [
16,
17]. Empirical studies have shown that such functions significantly reduce attacker advantage and allow defenders to trade modest increases in computation time for large increases in brute-force resistance [
18].
This paper introduces GeoVault, a key management framework that combines these insights. GeoVault derives cryptographic keys from one or more user-selected spatial locations, encoded deterministically using geospatial encoding schemes and hardened through memory-hard key derivation. By anchoring secrets in spatial memory and amplifying their security through computational hardness, GeoVault aims to mitigate the security–usability limitations inherent in brainwallet-style systems under offline attack models.
The main contributions of this work are as follows:
We introduce a formal entropy model for spatially anchored secrets that distinguishes nominal spatial entropy from effective entropy under attacker-prioritized spatial dictionaries, capturing realistic semantic and clustering biases in human location choice.
We analyze and quantify the security limits of spatial memory under realistic human selection behavior, showing how clustering and semantic constraints reduce effective entropy and how multi-point spatial selection mitigates this effect.
We empirically evaluate the defender–attacker asymmetry induced by memory-hard key derivation using Argon2id, demonstrating that modest multi-point spatial selection combined with memory-hard derivation can achieve attacker-adjusted work factors that approach or, under certain configurations, exceed those of 12-word BIP-39 mnemonics, while single-point configurations provide meaningful offline resistance with reduced cognitive burden. We further characterize the effect of KDF chaining depth k as an independent security axis, provide projected latency estimates across server, consumer laptop and smartphone hardware classes, and analyze GeoVault’s security margins under quantum (Grover) and ASIC adversary models.
The paper is structured as follows.
Section 2 reviews related work on brainwallets, mnemonic-based key storage, spatial cognition, and geospatial encoding.
Section 3 develops the theoretical foundations, including formal entropy models, attacker-prioritized spatial dictionaries, and computational hardening.
Section 4 presents the
GeoVault protocol design.
Section 5 reports the empirical benchmarks.
Section 6 discusses implications and limitations.
Section 7 concludes.
2. Related Work
2.1. Brainwallets and Their Weaknesses
Brainwallets represent a class of deterministic cryptocurrency wallets in which private keys are derived directly from user-chosen passwords or passphrases. The underlying assumption is that users can reliably memorize sufficiently complex secrets, thereby eliminating the need for physical or digital storage of private keys [
3,
19]. In practice, however, extensive empirical research has demonstrated that this assumption is rarely satisfied, leading to severe security and usability limitations.
The core vulnerability of brainwallets stems from the predictable nature of human-chosen passwords and passphrases. Vasek et al. [
3] conducted a large-scale empirical analysis of funded brainwallets and showed that users overwhelmingly select easily guessable phrases, resulting in a dramatic reduction in effective entropy. Their findings revealed that the majority of brainwallets were compromised shortly after being funded, often within minutes, underscoring the practicality of offline dictionary attacks against passphrase-derived keys.
Earlier work by Kuo et al. [
4] identified similar weaknesses in mnemonic phrase-based passwords, demonstrating that users frequently rely on common phrases sourced from online material. Yang et al. [
5] further confirmed that mnemonic-based password strategies, despite their perceived memorability benefits, produce secrets that remain highly vulnerable to statistical guessing attacks.
These vulnerabilities are compounded by fundamental cognitive limitations in human memory. Bonneau [
2] demonstrated that, despite repeated efforts to promote stronger password practices, most users fail to achieve sufficient entropy in practice. Adams and Sasse [
1,
20] explained this phenomenon from a usability perspective, arguing that increasing password complexity requirements directly conflicts with human memory constraints and predictably leads users toward insecure coping strategies.
Attempts to mitigate these weaknesses through user education have shown limited effectiveness. Kävrestad and Nohlberg [
21] found that context-based security training failed to produce meaningful improvements in real-world passphrase strength, highlighting the persistent tension between security requirements and human cognitive limitations.
Alternative authentication schemes that seek to exploit human visual memory have similarly encountered structural weaknesses. Tari et al. [
12] and Golla et al. [
13] investigated graphical and emoji-based password systems, identifying vulnerabilities such as susceptibility to shoulder-surfing and strong user bias toward predictable choices. While such approaches can improve memorability, they do not eliminate the fundamental problem of low effective entropy under targeted attack models.
To address the inherent weaknesses of brainwallets, alternative key management approaches have been proposed, including hierarchical deterministic (HD) wallets [
22,
23] and hardware-based wallets, commonly referred to as cold wallets [
19,
24]. HD wallets, standardized in BIP32, derive large key hierarchies from a single master seed, simplifying backup and recovery. However, Di Luzio et al. [
22] identified privilege escalation vulnerabilities in multi-user settings, while Gutoski and Stebila [
23] demonstrated that partial key leakage can enable recovery of the master secret, exposing the entire wallet hierarchy.
Cold wallets mitigate network-based attacks by storing private keys offline, but they introduce distinct attack surfaces. Das et al. [
19] analyzed the hot/cold wallet paradigm and formalized conditions under which such systems remain secure. Nevertheless, Guri [
24] experimentally demonstrated that air-gapped wallets can be compromised through physical side channels, including electromagnetic, acoustic, optical, and thermal leakage, challenging the assumption that hardware isolation alone provides absolute security.
In summary, brainwallets and mnemonic-based password schemes consistently suffer from two fundamental weaknesses: low effective entropy caused by predictable human-chosen secrets, and usability constraints rooted in cognitive limitations [
3,
4,
5]. While HD wallets and cold wallets mitigate certain risks, they introduce additional complexity and new failure modes, such as master-key leakage [
22,
23] or physical side-channel exfiltration [
19,
24]. These limitations motivate the exploration of alternative approaches that align more closely with human cognitive strengths. In particular, leveraging spatial memory—an ability known for its robustness and long-term retention—offers a promising foundation for secure and user-friendly key management.
2.2. Human Spatial Memory as a Cryptographic Asset
Human memory is notoriously unreliable when it comes to memorizing abstract information such as complex alphanumeric strings or passphrases [
6,
7]. However, decades of cognitive psychology research have shown that humans exhibit a disproportionately strong ability to encode, recall, and retain spatial information [
8,
9]. This observation underpins the “method of loci,” an ancient memory technique that leverages imagined spatial environments to recall information by placing memories at physical locations within a mental map [
25].
Neurological evidence also supports the unique role of spatial memory in human cognition. The hippocampus, a brain structure critical for memory formation, is also central to spatial navigation [
11]. The dual-use of this brain region suggests a strong evolutionary and functional link between spatial awareness and memory encoding. Studies in cognitive neuroscience confirm that people can recall spatial configurations with high accuracy even after long periods, particularly when these memories are grounded in visual or map-based stimuli [
26,
27].
Map-based authentication has been explored since at least
PassMap [
14] and
GeoPass [
15], which showed that geographic secrets are retained for 3–5 weeks with fewer errors than alphanumeric passphrases.
GeoPassNotes [
28] further showed that personally meaningful landmarks improve recall by activating episodic memory cues.
Despite these advantages, prior work in map-based authentication has identified severe entropy collapse caused by predictable human selection. Users consistently exhibit strong semantic bias toward residential locations (home/work), major urban centers, and famous global landmarks [
14,
15,
29]. A comprehensive usability and security evaluation by Al-Ameen and Wright [
29] further quantified the degree to which user selection distributions concentrate around personally and culturally salient locations, confirming that an adversary with even limited contextual knowledge of the user can substantially reduce the effective search space below the nominal geographic entropy. These selection patterns reduce the effective search space in ways that purely geographic dictionaries do not capture. While these studies establish the memorability of spatial secrets, they have not yet been integrated into a cryptographic key derivation framework that explicitly mitigates selection bias through memory-hardening and hardware-aware offline threat modeling.
These findings suggest that spatial memory may provide a superior foundation for cryptographic key storage and retrieval mechanisms. Unlike traditional brainwallets that rely on memorized strings, a system that anchors a seed derivation process to a spatial coordinate or map location naturally exploits this cognitive strength. By combining the robustness of spatial recall with computational functions that bind access to the correct location and effort, it is possible to achieve both higher security and improved usability.
2.3. Geospatial Encoding Systems
Geospatial encoding systems provide a structured way to represent geographic locations using codes or identifiers, facilitating applications such as navigation, logistics, and, in the context of this research, secure mnemonic storage based on spatial memory. These systems translate physical locations into a format that can be easily shared and processed, making them a critical component of the proposed GeoVault protocol.
A widely recognized example is What3Words (w3w), which partitions the globe into 3 m × 3 m squares, each assigned a unique three-word identifier. This system excels in usability due to its intuitive word-based format, reducing errors compared to traditional coordinates, and supports multiple languages [
30]. However, its fixed resolution limits its flexibility for applications requiring finer granularity or three-dimensional encoding, such as indoor environments [
31]. Moreover, recent critiques highlight potential confusion between similar word triplets, posing risks in critical applications like emergency response or secure storage [
32]. These limitations are particularly relevant to
GeoVault, where precise and unambiguous location encoding directly impacts mnemonic security.
Alternative systems include Geohash, Google’s Open Location Code (OLC), and Google S2. Geohash employs a hierarchical grid with alphanumeric codes, offering variable precision but suffering from complexity and potential user confusion due to its base32 encoding [
33]. Open Location Code, designed for offline use, provides a more accessible alphanumeric approach, balancing usability and scalability [
33]. Google S2, optimized for spatial indexing, uses a cell-based structure that supports efficient querying and could enhance the protocol’s computational efficiency [
33]. Each system presents distinct characteristics: Geohash and OLC prioritize flexibility, while S2 emphasizes performance in large-scale applications.
The choice of a geospatial encoding system for
GeoVault involves several trade-offs. Usability is critical for enabling users to recall and input locations accurately, favoring systems like What3Words [
30]. Granularity, or the size of the encoded area, affects the entropy of the system—a finer grid increases the number of possible locations, strengthening security against brute-force attacks. Encoding precision also influences the integration with computation-hard derivation functions like Argon2, as higher precision demands greater computational effort to protect the mnemonic seed. Additionally, reliability and accuracy are paramount, as geocoding errors can compromise the protocol’s integrity. Studies reveal significant variability in geocoding accuracy across services and regions, necessitating robust validation frameworks [
34,
35].
In this research, geospatial encoding systems underpin the spatial anchoring of mnemonic seeds. While What3Words offers a compelling balance of usability and precision, its proprietary nature and potential for confusion [
32] necessitate the exploration of open-source alternatives like Open Location Code for fully offline-resolvable implementations. Alternatives like Google S2 could improve scalability and precision, while custom schemes might better align with the protocol’s security requirements [
33]. Furthermore, address extraction and matching techniques from web data, as explored in [
36], could enhance location selection, ensuring users choose memorable yet secure spatial anchors.
Ultimately, the selected system must integrate seamlessly with the protocol’s threat model, resisting attacks such as map scanning or brute-forcing while maintaining user-friendliness. Future work will explore hybrid approaches, potentially combining What3Words’ simplicity with S2’s flexibility, and evaluate their performance in real-world mnemonic recovery scenarios.
2.4. Encryption Techniques for Hardening Secrets
VDFs [
37,
38], time-lock puzzles [
39], and proofs of sequential work (PoSW) [
40] each provide forms of sequentially enforced delay;
GeoVault instead uses memory-hard KDFs, which enforce a hardware-bound asymmetry between defenders and offline adversaries without requiring interaction or trusted setup.
Memory-hard KDFs, such as Argon2, transform inputs like spatial coordinates into cryptographic keys through processes that require substantial memory resources, thereby resisting optimization by hardware accelerators such as ASICs or GPUs [
16,
17]. Unlike compute-bound hashes, Argon2d binds attacker performance to memory bandwidth and capacity. This ensures that even high-end GPU clusters face a non-linear reduction in throughput once the memory requirements of the KDF approach the physical VRAM limits of the device. Although emerging compute-capable memory technologies may eventually reduce access costs for older functions like scrypt [
18], Argon2id remains the current standard for mitigating both side-channel and time–memory trade-off attacks in human-centered key management.
Together, Argon2id converts modest user-side latency into a disproportionately large attacker work factor. Hybrid combinations with PoSW remain an open direction for further hardening against parallelized map-scanning attacks.
3. Theoretical Background
3.1. Entropy in Mnemonic and Password-Based Secure Storage
Cryptographic key management systems frequently derive secret keys from human-memorable inputs, including machine-generated mnemonics, user-chosen passwords, and passphrases, collectively referred to as brainwallet-based constructions. The security of such systems is commonly expressed in terms of entropy, which characterizes the size of the underlying search space available to an offline attacker. Shannon entropy [
41] provides a fundamental measure of uncertainty in this context and is defined in Equation (
1).
where
denotes the alphabet of possible secrets and
is the probability of selecting element
.
While Shannon entropy provides a useful conceptual measure, resistance to offline guessing attacks is ultimately governed by the effective size of the attacker’s prioritized search space. In practice, this corresponds to the entropy remaining once human selection bias and attacker-adaptive guessing strategies are taken into account.
In the BIP-39 standard, mnemonic phrases are generated algorithmically by selecting words uniformly and independently from a fixed dictionary. A standard 12-word BIP-39 mnemonic uses a dictionary of 2048 entries and yields exactly 128 bits of nominal entropy, corresponding to possible mnemonic combinations. Under the assumption of uniform random generation, this construction provides strong cryptographic resistance against exhaustive brute-force attacks.
To illustrate the scale implied by 128 bits of nominal entropy, consider an offline attacker equipped with modern GPU hardware optimized for hashing operations. A high-end consumer GPU can evaluate on the order of
candidate keys per second under lightweight hashing assumptions [
42,
43,
44]. Under this assumption, the expected time required to exhaustively search the full mnemonic space is given by Equation (
2).
This corresponds to approximately
years, vastly exceeding the age of the universe. Consequently, uniformly generated BIP-39 mnemonics are effectively immune to brute-force attacks under realistic computational assumptions.
In contrast, password- and passphrase-based brainwallets derive cryptographic keys directly from user-chosen secrets. From a theoretical perspective, if a password or passphrase was selected uniformly at random from a dictionary of size
N, each symbol would contribute
bits of nominal entropy. The minimum length
L required to reach a target of 128 bits of nominal entropy is given by Equation (
3).
Table 1 summarizes this relationship for several representative selection sets.
While
Table 1 describes the entropy achievable under idealized uniform selection, extensive empirical evidence shows that human-generated passwords and passphrases violate these assumptions in practice. Large-scale analyses of real-world password datasets reveal that user choice introduces significant linguistic, cultural, and structural biases, resulting in a dramatic reduction in effective entropy under realistic attacker models.
Bonneau [
2], analyzing an anonymized corpus of nearly 70 million passwords, showed that typical user-chosen passwords provide fewer than 10 bits of resistance against online guessing attacks and on the order of 20 bits against optimized offline dictionary attacks. Vasek et al. [
3] similarly observed that brainwallet passphrases are highly predictable and frequently compromised shortly after funding. Earlier studies by Kuo et al. [
4] and subsequent work by Yang et al. [
5] further corroborate that mnemonic and passphrase-based strategies, despite increased length or apparent complexity, remain vulnerable to statistical guessing due to human selection bias. Collectively, these results establish entropy collapse as a fundamental limitation of password- and passphrase-based brainwallets and motivate security models that explicitly distinguish nominal entropy from effective entropy under attacker-adaptive guessing strategies.
3.2. Entropy in Spatial Memory-Based Systems
The cryptographic strength of spatially anchored mnemonic systems, such as GeoVault, derives from the discretization of a continuous spatial domain into a finite set of selectable cells. Each cell constitutes a mnemonic element analogous to a word in a linguistic dictionary. The entropy provided by such systems is therefore determined by the resolution of the spatial discretization and by the effective size of the attacker’s prioritized spatial search space.
3.2.1. Nominal Spatial Entropy
Let
denote a spatial domain with total surface area
, discretized into cells of area
. Under uniform random selection, the nominal entropy of a single spatial choice is defined in Equation (
4).
For the Earth’s surface (
) and a discretization with
, the nominal entropy of a single spatial cell is given by Equation (
5).
Achieving a target of 128 bits of entropy via a single spatial cell would require an unrealistically small cell area, as shown in Equation (
6).
which is infeasible in practice. Consequently, spatial systems must rely on either multiple spatial selections or computational entropy-boosting mechanisms.
3.2.2. Effective Entropy and Spatial Dictionaries
Nominal spatial entropy assumes uniform random selection across the spatial domain
. However, extensive research in spatial cognition shows that human spatial memory is hierarchically structured around landmarks, reference frames, and semantically meaningful regions, rather than represented as a uniform metric space [
8,
10]. As a result, realistic offline attackers can prioritize large portions of the search space by exploiting predictable human spatial behavior.
To model such attacks, we introduce the concept of spatial dictionaries, defined as attacker-prioritized subsets that reflect semantic, geographic, or contextual constraints on likely user choices. This abstraction captures coarse-grained attacker strategies, such as restricting guesses to landmasses, urban areas, coastlines, or other salient regions, without assuming knowledge of user-specific preferences or fine-grained popularity rankings.
Let
denote the total area covered by a spatial dictionary
. The effective entropy of a single spatial selection under this attacker prior is given by Equation (
7).
The corresponding entropy collapse relative to the global nominal case is defined in Equation (
8).
Equation (
7) models the attacker’s geographic prior—which dictionary stratum is targeted—but treats selection within the targeted stratum as uniform. Empirical studies of geographic authentication systems demonstrate that this uniformity assumption overstates effective entropy: users disproportionately choose home addresses, workplaces, frequently visited venues, and culturally salient landmarks, concentrating realistic choices over a sub-fraction of the dictionary’s nominal coverage area [
14,
15,
29]. To incorporate this within-dictionary behavioral bias, we introduce a selection fraction
, representing the proportion of cells in
that attract non-negligible probability mass under a demographically informed prior. The corrected effective entropy is
Setting
recovers the uniform selection model. Because the empirical literature does not yet provide a universal estimate of
applicable across all geographies and user populations, all security evaluations in subsequent sections treat
as an upper bound on true effective entropy: actual effective entropy is lower whenever
, i.e., whenever user selection within the targeted stratum is non-uniform. All security zone classifications in
Section 5 are therefore optimistic for the defender and should be interpreted as best-case estimates pending empirical calibration of
. The targeted attacker scenario is addressed by selecting the appropriate fine-grained dictionary (e.g., the urban-scale stratum with
bits), which already encodes significant behavioral prior knowledge;
provides the formal mechanism for analysts to apply further within-stratum corrections as empirical data become available.
3.2.3. Clustering Bias and Localized Collapse
A particularly severe form of entropy collapse arises when users select multiple locations within a familiar local context. If an initial anchor point
is selected freely, but subsequent points
are constrained to lie within a radius
r of
, the entropy contribution of each additional point is bounded by the area of the surrounding neighborhood. Under this proximity constraint, the entropy contribution per additional point is given by Equation (
10).
The total effective entropy for
n spatial selections under this proximity-constrained model can then be approximated as shown in Equation (
11).
The term
denotes the effective entropy of the first (anchor) point and is bounded above by the entropy of the spatial dictionary
from which the user selects, i.e.,
. Under a geographically uninformed attacker prior,
bits. However, as documented in geographic password research [
14,
15,
29], a demographically informed attacker who knows the user’s city of residence may restrict the search to an urban-scale dictionary, reducing
to ≈27.4 bits. This anchor entropy collapse propagates into all subsequent
n-point entropy estimates: the total loss relative to the uninformed model is
regardless of
n, making the anchor the dominant security-relevant factor against a targeted adversary.
For clustering at the city scale (e.g., ), this model shows that even multiple spatial selections fail to approach a 128-bit entropy target under realistic attacker assumptions. This observation demonstrates that spatial entropy alone is insufficient to guarantee cryptographic-strength security in the presence of predictable clustering behavior and motivates the integration of computational entropy-boosting mechanisms.
3.3. Entropy Boosting Techniques
Given the practical limits of spatial entropy, spatially anchored mnemonic systems such as GeoVault rely on entropy-boosting techniques to increase resistance against offline brute-force attacks. Rather than increasing the number or resolution of spatial elements, these techniques amplify security by increasing the computational cost of each guess.
Key derivation functions (KDFs) transform low-entropy or biased inputs into cryptographically strong keys by imposing configurable time and memory costs per evaluation. Memory-hard KDFs such as Argon2 are particularly effective in this role, as they bind attacker performance to memory bandwidth and capacity rather than raw parallel compute throughput, thereby reducing the advantage of GPUs and ASICs. When combined with spatial mnemonics, memory-hard KDFs allow systems to achieve high attacker-adjusted work factors despite predictable human selection behavior.
4. Materials and Methods
4.1. Protocol Design
The GeoVault protocol derives cryptographic keys from one or more user-chosen spatial locations by combining deterministic geospatial encoding with memory-hard key derivation. The design goal is to anchor secrets in human spatial memory while ensuring that key derivation remains fully reproducible and verifiable in an offline setting.
At a high level, the protocol consists of four conceptual stages: spatial selection, geospatial encoding, entropy boosting, and key extraction. First, the user selects a finite set of spatial points within a predefined spatial domain. These selections are made deliberately and are assumed to be recallable by the user without external storage. Second, each selected point is deterministically mapped to a discrete spatial cell using a geospatial encoding scheme, transforming continuous coordinates into a finite and reproducible representation that can be evaluated offline.
Third, the encoded spatial identifiers are combined and processed using a memory-hard key derivation function. This entropy-boosting stage is implemented as a single invocation of a memory-hard KDF with configurable time and memory parameters. It does not introduce additional information, but instead increases the computational and memory cost associated with evaluating each candidate secret, thereby reducing the feasibility of large-scale offline brute-force attacks, particularly on GPU-class hardware.
Finally, the output of the key derivation function is used directly as a cryptographic key or as input to a higher-level key management mechanism, depending on the application context.
The protocol makes no assumptions about secrecy of the geospatial encoding scheme or obscurity of the spatial discretization. All security derives from the unpredictability of the user’s spatial choices and the computational hardness imposed by the key derivation function. By separating human-memorable input selection from computational hardening, GeoVault allows the security–usability trade-off to be adjusted through two independent parameters: the number of spatial selections and the cost parameters of the key derivation function.
4.2. Spatial Selection
GeoVault assumes that a user selects a finite set of geographic points. Selection is performed by clicking or tapping a location on a digital map interface; no GPS or device location service is involved. The geospatial encoding scheme (
Section 4.3) immediately snaps any selected coordinate to the nearest discrete cell, so sub-cell coordinate imprecision is absorbed by design. GPS-induced drift is therefore not a concern in the
GeoVault threat model. Formally, the set of selected locations is defined in Equation (
12).
Here,
denotes the spatial domain of the application, and each point
is represented by its coordinates within that domain.
In the baseline instantiation, corresponds to the WGS-84 model of the Earth’s surface. However, the definition is deliberately abstract: may also represent a fictional continent, a virtual environment, or any other two-dimensional world equipped with a well-defined coordinate system. No randomness, salting, or grid quantization is imposed at this stage; each point is treated as an exact element of prior to encoding.
The cardinality n of is user-defined and represents a primary degree of freedom in the security–usability trade-off. Selecting more points increases the available spatial entropy but also increases the cognitive burden during recall. This trade-off is analyzed quantitatively in later sections through entropy modeling and attacker-adjusted work factor evaluation.
GeoVault exposes two primary security axes and one fine-tuning lever. The two primary axes are qualitatively different in kind: one grows the search space exponentially, the other collapses attacker throughput non-linearly via a hardware bottleneck. The fine-tuning lever scales costs linearly on both sides and belongs to a fundamentally different class.
(1) Number of spatial points n—exponential axis. The attacker’s search space grows exponentially in n (as ), while the legitimate user’s key derivation cost remains constant—a single Argon2id invocation regardless of how many points are selected. Each additional memorable location multiplies the attacker’s workload by under a uniform prior, at zero additional computational cost to the user. This is the dominant security axis: it is the only mechanism capable of reaching the Super Secure zone and the only one that provides a strictly improving defender–attacker ratio as the parameter increases.
(2) Memory parameter
m—non-linear hardware axis. Increasing the Argon2id memory cost raises the per-candidate evaluation cost for both parties, but the attacker’s throughput collapses non-linearly once
m exceeds available GPU VRAM, enforcing a hardware-bound bottleneck that compute parallelism cannot bypass. This axis is analyzed empirically in
Section 5. It cannot reach the Super Secure zone for
under any evaluated
m, but it determines the security zone for borderline configurations and sets the absolute work factor floor.
(3) KDF chaining depth
k—linear fine-tuning lever. A user may chain
k Argon2id invocations in series, where each call’s output becomes the password input of the next (Equation (
13)). Unlike axes (1) and (2), chaining scales both defender and attacker costs by exactly the same factor
k: the defender waits
longer, but the attacker’s throughput drops to
. Because the scaling is symmetric and linear,
k does not change the defender–attacker ratio
, cannot rescue a configuration that is already in the wrong security zone, and cannot substitute for
n or
m. Its role is narrow but practical: it allows a user whose
configuration lands just below a zone boundary to cross that threshold by accepting a proportionally longer wait, without memorizing additional locations or changing system parameters. The three parameters are orthogonal and can be combined freely, but should be sized in priority order: first fix
n, then
m, then optionally tune
k.
4.3. Geospatial Encoding
After the user selects a set of spatial points , GeoVault deterministically maps each point to a discrete spatial cell and produces a corresponding identifier. The purpose of geospatial encoding is to transform continuous spatial coordinates into a finite, reproducible representation that can be evaluated offline by both legitimate users and adversaries.
We formalize a geospatial encoding scheme as the composition defined in Equation (
14).
Here,
partitions the spatial domain into equal-area cells and returns the cell containing a point
, while
assigns each cell a deterministic identifier over a finite alphabet. The mapping
E is applied independently to each
, yielding a multiset of identifiers
The identifier strings themselves are not assumed to contribute cryptographic entropy; GeoVault relies exclusively on the unpredictability of the underlying spatial selections. Consequently, the encoding scheme must satisfy three core properties. First, injectivity: distinct spatial cells must map to distinct identifiers. Second, determinism: the same spatial point must always yield the same identifier. Third, offline resolvability: the encoding must be computable without access to a trusted online service. Beyond these requirements, the encoding need not be cryptographically random or secret and may be designed primarily for human usability.
4.3.1. Baseline Encoder (What3Words)
In the reference implementation, both
and
are instantiated using the What3Words grid and lexicon. The spatial domain is partitioned into 3 m × 3 m cells, each mapped to a unique three-word phrase. For a single unconstrained cell, substituting the Earth’s surface area
and cell area
yields a nominal entropy of approximately
bits, as derived previously in Equation (
5). This value reflects the size of the global cell index and is independent of the linguistic structure of the identifier.
Because a single encoded cell does not provide sufficient entropy for high-security applications, GeoVault supports aggregation of multiple encoded spatial points and introduces additional computational hardening in subsequent stages. Importantly, the cryptographic core of GeoVault remains encoder-agnostic: any geospatial encoding scheme that satisfies the above properties may replace the What3Words system without altering the security model or attacker assumptions.
4.3.2. Offline Resolvability and Open Alternatives
The word-to-coordinate resolution of What3Words depends on a proprietary algorithm and, in standard deployments, requires the What3Words API or a bundled SDK—a tension with the offline resolvability requirement above. For strictly offline deployments, Open Location Code (OLC/Plus Codes) [
33] is the recommended alternative: fully open, requiring no external service, and satisfying all three encoder properties. At precision level 11 (cell footprint
), OLC achieves a nominal single-point entropy comparable to the What3Words baseline. The What3Words instantiation is retained here as a usability reference; the security model and all quantitative results are independent of encoder choice.
4.4. Entropy Boosting
The entropy provided by spatial selection alone is insufficient to guarantee cryptographic-strength security under realistic offline attacker models, particularly in the presence of semantic bias and spatial clustering (
Section 3.2).
GeoVault therefore incorporates an explicit entropy-boosting stage based on a memory-hard key derivation function (KDF), which increases the computational cost of each candidate evaluation without introducing additional information entropy.
This mechanism does not increase Shannon entropy, but instead amplifies the computational hardness faced by an attacker by enforcing substantial time and memory costs per guess. As a result, brute-force resistance depends jointly on the effective spatial entropy and the hardware-bound cost of key derivation.
4.4.1. Key Derivation Function
GeoVault employs Argon2id, the hybrid variant of the Argon2 memory-hard KDF [
16]. Argon2id was selected for its resistance to GPU- and ASIC-accelerated attacks, combining data-independent memory access (Argon2i) with data-dependent access (Argon2d) to mitigate both side-channel and time–memory trade-off attacks.
Unlike legacy KDFs such as PBKDF2, Argon2id binds attacker performance to memory bandwidth and capacity rather than raw compute throughput, enabling strong defender–attacker asymmetry through tunable time, memory, and parallelism parameters.
Parameter Selection Rationale
The reference protocol uses
, derived from two constraints. Usability constraint: The defender latency must remain below a practical interactive threshold. The empirical benchmarks of
Section 5.2 confirm that a single Argon2id invocation at
MiB completes in
s on a commodity server CPU within the sub-two-second budget widely cited as acceptable for authentication operations [
45]. Security constraint: The memory cost must exhaust the GPU’s high-bandwidth memory per instance to eliminate parallelism gains. At
MiB the GPU can sustain at most
parallel instances, yielding an attacker throughput of
H/s (
Section 5.2). Setting
follows the Argon2 specification’s recommendation that a single pass with high
m provides stronger memory hardness guarantees than multiple passes with reduced
m [
16]. Setting
is appropriate for a single-thread defender on commodity hardware; increasing
p beyond the available CPU cores would not reduce defender latency and would only benefit attackers with wide SIMD parallelism. The parameter range
is evaluated empirically in
Section 5.2 to characterize the full defender–attacker trade-off space.
4.4.2. Input Construction
Let
denote the set of deterministic identifiers produced by geospatial encoding (
Section 4.3). These identifiers are concatenated in a fixed canonical order:
and hashed to obtain a fixed-length KDF input
where
H denotes SHA-256 in the reference implementation.
Canonical Ordering and Usability
The concatenation is order-sensitive: swapping two identifiers produces a different S and therefore a different key. This preserves the permutation space as part of the secret, at the cost of requiring the user to recall both the locations and their entry order. Entering the same locations out of order silently yields the wrong key. Lexicographic sorting would remove this requirement at a cost of bits of entropy (e.g., ≈1.6 bits for ); the ordered construction is the reference throughout this paper.
4.4.3. Argon2id Hardening
The spatial hash
X is processed using Argon2id as defined in Equation (
18):
where
t is the iteration count,
m the memory cost, and
p the degree of parallelism. The salt is fixed and public and does not contribute entropy.
All parameters
are assumed to be known to the attacker, consistent with Kerckhoffs’ principle. Security arises from the enforced memory footprint of Equation (
18), which bounds attacker parallelism by available high-bandwidth memory. The resulting defender–attacker asymmetry and its impact on offline brute-force resistance are quantified empirically in
Section 5.
Fixed Salt and Rainbow Table Resistance
A fixed, public salt follows established practice: BIP-39 does the same, deriving keys via . Rainbow tables are infeasible for two reasons. First, each evaluation at occupies of high-bandwidth memory, making a table over even entries physically unrealizable. Second, the KDF input space reaches bits for n points, far beyond any feasible precomputation budget. The fixed salt introduces no exploitable weakness beyond what the entropy model and KDF cost parameters already account for.
Protocol-Fixed Parameters and User Cognitive Load
are protocol-level constants—analogous to BIP-39’s fixed 2048-iteration count—and are never recalled by the user. The only user-facing parameter is the chaining depth
k, which defaults to
and is expressed as a single integer. The complete derivation state the user must retain is therefore (1) their spatial point(s), and (2)
k if non-default.
Section 5 surveys the full
spectrum to characterize attacker throughput; the reference parameters are chosen from this spectrum to target a ≈1 s defender latency on commodity hardware.
4.5. Security Evaluation Methodology
The goal of our evaluation is to quantify how much practical protection a user gains from a given key derivation cost when the defender derives keys on a CPU while the attacker mounts an offline brute-force attack using GPU-class hardware. In particular, we seek to answer questions of the form: “if a user is willing to spend seconds on a memory-hard key derivation function, what level of brute-force resistance does this provide against a realistic GPU-equipped attacker?”
Our methodology decomposes the problem into three components: a defender (user) cost model, an attacker cost model, and a derived security metric that combines both into a single, comparable quantity.
4.5.1. Defender (User) Cost Model
On the defender side, we assume that the user derives keys using Argon2 on a commodity CPU. For a given choice of Argon2 parameters
, where
t denotes the number of iterations,
m the memory cost, and
p the degree of parallelism, and a chaining depth
(
Section 4.1), the total wall-clock time required for key derivation is defined in Equation (
19).
where
denotes the runtime of a single Argon2id invocation. The factor
k scales the defender latency linearly; the empirical evaluation in
Section 5 benchmarks the
case and all
results follow by multiplication. This quantity directly captures the usability cost: larger values increase resistance to brute-force attacks but also increase the delay experienced by legitimate users. In our evaluation, we restrict attention to parameter sets and chaining depths for which
remains within an acceptable latency budget for interactive use.
4.5.2. Attacker (GPU) Cost Model
On the attacker side, we assume access to parallel computing resources typical of modern GPUs and distinguish between two fundamentally different classes of operations.
Fast hash evaluations (e.g., SHA-256) are compute-bound and highly parallelizable, and are relevant for attacking mnemonic schemes without additional computational hardening, such as raw BIP-39 verification.
Memory-hard KDF evaluations (Argon2id) are constrained by memory bandwidth and capacity rather than raw compute throughput, and are relevant for attacking spatial mnemonics protected by a memory-hard key derivation function. In this case, the attacker is assumed to evaluate the same Argon2 parameter set as the defender.
For each class of operations, we characterize the attacker by an effective guess rate, defined in Equations (
20) and (
21).
A key distinction between the two cases lies in their scalability under parallelism. Fast hash functions scale primarily with available compute resources and can exploit massive parallelism efficiently. In contrast, parallelism for memory-hard KDFs is intrinsically bounded by available high-bandwidth memory. For an attacker with total memory capacity
M, the maximum number of fully independent parallel KDF instances is upper-bounded by
where
m denotes the per-instance memory cost. Once this bound is reached, additional compute units cannot be exploited to increase throughput, and attacker performance becomes memory-bound rather than compute-bound.
4.5.3. Derived Security Metric: Attacker-Adjusted Work Factor
To compare the brute-force resistance of mnemonic- and spatial-based schemes under realistic adversarial capabilities, we employ a unified security metric termed the attacker-adjusted work factor, denoted . This metric captures the expected time required for an offline adversary to exhaust the effective search space of candidate secrets under the assumed attacker model.
Let
H denote the effective entropy of the scheme, so that the total number of candidates is
. Let
C represent the computational cost required to evaluate a single guess, expressed in hash-equivalent operations, and let
R denote the attacker’s sustained guess rate in evaluations per second. The expected attacker work factor is given by Equation (
22).
Empirical Instantiation
In practice, the attacker’s guess rate
R is instantiated using measured GPU throughput, while
C is determined by the cost of the key derivation function (KDF) or hash evaluation used by the scheme. For constructions employing a memory-hard KDF such as Argon2 with parameters
and chaining depth
k, the attacker must perform
k sequential Argon2id invocations per candidate (
Section 4.1, Equation (
13)), yielding the attacker-adjusted work factor in Equation (
23).
Since the serial data dependency in Equation (
13) prevents the attacker from parallelizing across the
k rounds for a single candidate, the effective GPU throughput for a chained evaluation is
, and the work factor scales linearly with
k. All results reported in
Section 5 correspond to
; the work factor for any
is obtained by multiplying the reported values by
k.
For baseline schemes without computational hardening—such as BIP-39 mnemonics or single-point spatial secrets verified using a single cryptographic hash—the work factor reduces to Equation (
24).
Interpretation via Security Zones
To aid interpretation of attacker-adjusted work factors, we categorize security outcomes into three qualitative zones based on the expected offline attack time. These zones are not intended as strict security guarantees, but as interpretive reference points commonly used in password security and cryptographic practice [
46].
Insecure ( s). Configurations for which corresponds to attack times on the order of hours, days, or weeks on commodity GPU hardware.
Human-Scale Secure ( s s). Configurations for which corresponds to attack times on the order of years to centuries under the assumed attacker model, reflecting security levels commonly considered sufficient for systems relying on human-memorable secrets.
Super Secure ( s). Configurations for which approaches or exceeds the effective brute-force resistance of uniformly generated 128-bit secrets, such as standard BIP-39 mnemonics, and are widely regarded as computationally infeasible even for highly resourced adversaries.
The two threshold values are derived as follows. The lower boundary of
s (≈317 years) represents a conservative upper bound on the sustained effort a highly resourced adversary—including a nation-state actor—could plausibly dedicate to a single targeted brute-force campaign [
46]. A secret with
s is considered practically breakable, since even a prolonged but finite multi-year investment by such an adversary would exhaust the search space within a realistic operational horizon. The upper boundary of
s is anchored to the standard 128-bit security level. Using the empirically measured SHA-256 attacker throughput
H/s (reported in
Section 5.2), exhausting a
-candidate search space requires
This matches the brute-force resistance implied by the BIP-39 standard’s 128-bit entropy target and is consistent with widely adopted guidelines for symmetric cryptographic security [
46]. Secrets above this threshold are considered cryptographically equivalent to uniformly random 128-bit keys under the evaluated attacker model.
The Results Section uses this attacker-adjusted work factor and the associated security zones to interpret empirical measurements across different spatial entropy levels and key derivation parameter choices.
4.6. Construction of Spatial Dictionaries
All spatial dictionaries use a fixed cell resolution of , corresponding to a cell area of .
For areal regions, including the global surface, terrestrial land, habitable land, and urban boundaries, the effective area
is set directly to the reported surface area of the corresponding region, as obtained from established geographic datasets [
47,
48]. These dictionaries represent attacker strategies that restrict guesses to broad but semantically meaningful regions.
For linear features such as coastlines and rivers, the effective area is approximated by buffering the feature along its total length. For a feature of total length
L (in km), the effective area is given by Equation (
26).
where
denotes the buffer width expressed in kilometers. For rivers, buffers are applied symmetrically to both banks, yielding the effective area defined in Equation (
27).
For point-like semantic anchors such as UNESCO World Heritage sites or major mountain peaks, each anchor is modeled as a disk of radius
w. For a dictionary containing
such anchors, the effective area is approximated by Equation (
28).
Overlaps between buffered regions are ignored in all cases, yielding conservative, attacker-favoring estimates of effective area and corresponding entropy.
4.7. Experimental Setup
All measurements in this paper, including BIP-39 and Argon2id, were obtained on a single controlled workstation and software environment to ensure methodological consistency. The purpose of this setup is to empirically characterize both defender-side computation costs and attacker-side throughput under realistic CPU and GPU execution models. No parameter tuning beyond vendor- or implementation-recommended defaults was applied unless explicitly stated.
Table 2 summarizes the hardware platform, operating system, GPU driver stack, and cryptographic benchmarking tools used throughout the study. Hashcat was used for both PBKDF2 (mode 12100) and Argon2id (mode 8200) benchmarks to ensure comparability across key derivation functions.
Attacker-side benchmarks used configurations maximizing effective guess throughput under the memory-bound parallelism constraint of
Section 4.5.
Benchmark configurations were selected to reflect canonical settings of the evaluated schemes while ensuring reproducibility and a clear separation between defender and attacker constraints. For PBKDF2-HMAC-SHA512, the BIP-39 specification was followed using 2048 iterations and the fixed salt string “mnemonic”. Benchmark tools that internally employ reduced iteration counts were used only to measure raw throughput, and reported attacker rates were scaled linearly to correspond to the full BIP-39 iteration count.
For Argon2-based spatial mnemonics, the Argon2id variant was evaluated using parameter triples , where t denotes the number of passes, m the memory cost per instance, and p the degree of parallelism. Defender-side measurements were obtained using fixed parameter sets to characterize user-perceived latency under interactive constraints. Attacker-side measurements, in contrast, were obtained under maximal-throughput configurations in which available GPU parallelism and memory allocation were selected to saturate device resources and maximize the measured guess rate . All benchmarks were executed under sustained load and repeated across multiple runs to mitigate transient system effects, yielding optimistic attacker throughput estimates that are used in subsequent sections to compute attacker-adjusted work factors.
5. Results
In this section, we instantiate the analytical framework of
Section 4.5 for concrete mnemonic schemes. We quantify resistance against offline brute-force attacks by separating (i) the effective search space size, expressed via input entropy
H (so
), from (ii) empirically measured attacker throughput
R on representative GPU hardware.
5.1. Effective Spatial Entropy Under Attacker Priors
We first establish the effective search space (
N) for spatial inputs under attacker-prioritized priors. Using the spatial dictionary construction defined in
Section 4.3, we evaluate the effective entropy
for increasingly pessimistic spatial dictionaries.
Table 3 summarizes the results. The “Omnibus HPZ” dictionary represents a pessimistic upper bound on attacker knowledge, assuming the user selects a semantically salient region within a broad human-preferred zone (HPZ), operationalized as the union of habitable land, coastlines, rivers, and dense urban regions.
Crucially, even under the pessimistic Omnibus prior, the effective entropy remains ≈ 41.4 bits, substantially higher than typical human-chosen passwords, which empirical studies frequently place below ≈20 effective bits under adaptive guessing models [
2]. The 41.38-bit Omnibus value refers to spatial entropy alone and does not determine security zone membership. Zone classification in
GeoVault is based on the attacker-adjusted work factor
, which incorporates the Argon2id cost. As shown in
Section 5.6, an
secret with Omnibus spatial entropy reaches the Human-Scale Secure zone at
MiB, and at
MiB,
s—more than six orders of magnitude above the Insecure threshold.
Across all spatial dictionaries evaluated in
Table 3, the effective per-point entropy spans the range
bits, bounded below by the Urban (city-scale) prior and above by the Omnibus HPZ prior. For an
n-point secret under independent-point selection, the total effective entropy therefore satisfies
where the lower bound corresponds to a demographically informed attacker with city-level knowledge of the user’s location, and the upper bound corresponds to a geographically uninformed attacker restricted to the broadest realistic human-preferred zone. No attacker prior evaluated in this work contracts the per-point entropy below 27.4 bits without sub-neighborhood-scale knowledge of the user’s location, confirming that this bracket spans the full range of evaluated threat models.
Radius Constraints on Multi-Point Selection
For scenarios involving multiple spatial points, the total spatial entropy is constrained by geographic clustering.
Figure 1 visualizes the total effective entropy
as a function of the clustering radius
r under the proximity model defined in
Section 3.2.3. For small radii, additional points contribute limited independent entropy due to spatial correlation, whereas increasing dispersion increases the effective search space.
The figure further illustrates that for , the achievable entropy saturates below the 128-bit target even under maximal geographic dispersion, as the second point contributes at most the global single-cell entropy.
Table 4 reports the raw entropy limits and the dispersion radius required to reach the 128-bit spatial entropy target defined in
Section 4.5 for increasing numbers of selected points under the proximity model of
Section 3.2.3.
The entropy values in
Table 4 are bounded by a maximum feasible dispersion radius corresponding to the Earth’s circumference (approximately
km). For configurations in which the required radius exceeds this bound, the 128-bit entropy target defined in
Section 4.5 cannot be achieved through geographic dispersion alone. This Earth-scale constraint defines a hard upper limit on the entropy contribution of additional spatial points under the proximity model of
Section 3.2.3 and explains the saturation behavior observed for small
n in
Figure 1.
5.2. Key Derivation Cost: Defender Latency vs. Attacker Throughput
To instantiate attacker-adjusted work factors (
Section 4.5), we measured defender-side latency on CPU and attacker-side throughput on GPU. These measurements instantiate the attacker rates
and
in Equations (
24) and (
23).
5.2.1. Defender-Side Key Derivation Latency
A legitimate user (defender) performs a single key derivation event during vault decryption. To quantify the user-visible cost, we measured key derivation latency on the CPU platform specified in
Section 4.7.
For the unhardened baseline (SHA-256), computational overhead was negligible (<0.01 ms). The BIP-39 baseline, which uses PBKDF2-HMAC-SHA512 with 2048 iterations, exhibited a median latency of 3.57 ms, a delay that is effectively imperceptible to users. The full latency distribution is reported in
Table 5.
In contrast,
GeoVault applies Argon2id to deliberately impose a hardware-bound delay on key derivation.
Table 6 reports defender-side latency as a function of the Argon2id memory parameter
m (with
and
). Configurations up to 1024 MiB remain within a few seconds and are suitable for interactive use, whereas extreme configurations at 16–32 GiB incur minute-scale delays consistent with archival or cold-storage usage.
5.2.2. Attacker-Side Guess Throughput
Attacker-side capabilities were benchmarked using Hashcat under the attacker model defined in
Section 4.5. We distinguish between compute-bound primitives, which scale efficiently with GPU parallelism (SHA-256, PBKDF2), and memory-bound primitives, for which throughput is fundamentally constrained by available high-bandwidth memory (Argon2id).
BIP-39 Throughput (PBKDF2)
For linguistic mnemonics, an offline attacker evaluates candidate phrases using PBKDF2-HMAC-SHA512 as specified by the BIP-39 standard. Hashcat benchmarks were scaled to the full 2048-iteration configuration. The measured sustained throughput is reported in
Table 9:
Fast Hash Throughput (SHA-256)
For unhardened secrets verified using a single cryptographic hash, the attacker employs raw SHA-256. Benchmarks indicate a sustained throughput of approximately
with only minor degradation as input length increases. This rate serves as the baseline attacker capability for raw spatial and password-based secrets without computational hardening.
Argon2id Throughput (Memory-Hard Regime)
To characterize attacker performance against
GeoVault-style hardened secrets, we measured Argon2id throughput across a wide range of memory costs
m with fixed parameters
.
Table 10 reports the resulting attacker throughput as a function of
m.
The results show a sharp, non-linear collapse in attacker throughput as the memory cost approaches and exceeds the GPU’s available VRAM. At MiB, throughput is already reduced by more than five orders of magnitude relative to SHA-256. At 16–32 GiB, attacker throughput falls below one guess per second, indicating entry into a hardware-capacity regime in which GPU parallelism is no longer exploitable and brute-force attacks become effectively serialized.
Defender–Attacker Throughput Asymmetry
To separate the two primary security axes of
GeoVault—memory cost
m and spatial dimensionality
n—and isolate the KDF’s standalone contribution, we define the GPU throughput advantage:
where
is the attacker GPU guess rate (H/s) and
is the defender CPU key derivation latency (s). The quantity
is the number of attacker guesses that can be tested in the time it takes the legitimate user to perform one key derivation. Its value as a design metric lies in decomposing the security problem:
characterizes the residual attacker advantage after
m is fixed, independently of the search space size set by
n.
Table 11 reports
for the measured Argon2id tiers.
Two structural properties follow directly from this decomposition. First, KDF chaining depth
k leaves
unchanged. Since
and
, the product
. Chaining raises the absolute cost of each guess symmetrically but cannot change the ratio; it belongs to the
m-axis, not to a third independent axis. Second, spatial dimensionality
n is orthogonal to
. Adding a spatial point multiplies the search space by
per point (
Table 4) without affecting either
or
. Consequently,
can be read directly as the fraction of the security budget that must be covered by the search space rather than by the KDF: at
, the search space must be at least 66 imes larger than the KDF work factor to clear any given security threshold, a condition met with wide margin at
.
Argon2id Throughput Under KDF Chaining
Table 12 reports derived attacker throughput under KDF chaining at the
MiB reference tier, computed as
with
H/s (
Section 5.2.1; CPU linearity validates the formula,
Table 8; all time-to-compromise values are conservative lower bounds).
ASIC Attack Resistance
A natural concern is whether purpose-built application-specific integrated circuits (ASICs) could exceed the measured GPU attacker throughput and invalidate the work factor estimates. For Argon2id, this is fundamentally limited by the memory bandwidth bottleneck rather than compute throughput. The Argon2 specification formally establishes that any implementation allocating less than the nominal memory
m incurs an exponential time penalty through mandatory data-dependent recomputation, precluding any space–time trade-off [
16]. A hypothetical Argon2id ASIC must therefore provision high-bandwidth memory comparable to the target GPU; savings come only from eliminating general-purpose execution overhead in the compute fabric, which the Argon2 designers estimate yields a practical speedup of approximately 2–
over GPU-class hardware for realistic ASIC implementations [
16,
17]. Applying a conservative
ASIC advantage to the reference attacker rate (
H/s at
MiB) yields
H/s. This reduces all reported Argon2id attacker-adjusted work factors by at most
bits. At
spatial points, the baseline work factor exceeds
s (Super Secure zone,
Section 5.6); a 3.3-bit reduction leaves it at >
s, well within the same zone. For the
case at
MiB, the work factor remains in the Human-Scale Secure zone even under this pessimistic ASIC bound. The GPU benchmark therefore provides a conservative upper bound on adversarial throughput under memory-hard protection, and ASIC-specific threats do not materially alter the security conclusions of
Section 5.4,
Section 5.5 and
Section 5.6.
5.3. Baseline Brute-Force Resistance Under Fast Hashing
We analyze the security of linguistic and spatial mnemonics in their raw state, assuming an attacker utilizes a standard fast hash primitive (SHA-256) without memory-hard protection. In this setting, entropy is treated as a fixed input parameter, and security is evaluated exclusively via the attacker-adjusted work factor
defined in Equation (
22). Following
Section 4.5, we categorize security into three qualitative zones based on
: Insecure (<
s), Human-Scale Secure (
–
s), and Super Secure (≥
s).
5.3.1. Brute-Force Resistance of Linguistic Secrets
For typical human passwords, brute-force resistance under fast hashing is determined by the effective entropy implied by alphabet size and length. As established in
Section 2.1, human-chosen passwords frequently collapse to approximately 20 bits of effective entropy under adaptive guessing models. Using the baseline attacker work factor formulation for fast hashing given in Equation (
24), the expected time to compromise such a secret is
To transition from the Insecure zone into the Super Secure (BIP-39) regime without computational hardening, a user would need to memorize a truly random ASCII-94 string. The required string length
L follows directly from the entropy target:
Such strings (e.g.,
7&y#B9@q!x2$LpZ*5mW1) substantially exceed typical human cognitive limits for reliable recall, illustrating the well-known usability barrier of high-entropy linguistic secrets.
5.3.2. Brute-Force Resistance of Spatial Secrets
In contrast, spatial mnemonics exhibit substantially higher baseline brute-force resistance due to their higher effective entropy. Applying the same baseline work factor model of Equation (
24) to a single unhardened What3Words location (
) yields an expected attack time of approximately
s (about 50 days). While this remains within the Insecure zone, it represents roughly a seven-order-of-magnitude improvement over typical human-chosen passwords.
Figure 2 visualizes this equivalence in brute-force resistance between spatial and linguistic secrets across the three defined security zones under fast hashing.
As the number of spatial points increases, the total search space expands sufficiently for spatial mnemonics to exceed the 128-bit security threshold even without computational hardening.
Table 13 reports the geographic dispersion required for the baseline work factor of Equation (
24) to enter the Super Secure regime under SHA-256.
These results highlight the fundamental advantage of spatial mnemonics under fast hashing: achieving super-secure brute-force resistance requires either memorizing approximately 20 random characters (cognitively impractical) or recalling a small number of geographically meaningful locations (cognitively feasible).
5.4. Brute-Force Resistance Under Memory-Hard KDFs
We analyze the impact of memory-hard key derivation on brute-force resistance by applying Argon2id across a wide range of memory costs, including extreme configurations (16 GiB and 32 GiB) that approach or exceed the physical VRAM limits of professional-grade GPU hardware. As in the previous subsection, entropy is treated as a fixed input parameter, and security is evaluated via the attacker-adjusted work factor
defined in Equation (
22). Throughout this analysis, BIP-39–equivalent security is defined as a time-to-compromise exceeding
s, consistent with
Section 4.5.
Table 14 reports the corresponding time-to-compromise values for a representative linguistic secret (approximately 20 bits of effective entropy) and a single spatial point (
, approximately
bits), derived from the attacker throughput measurements reported in
Section 5.2. All values are computed using the Argon2id work factor model of Equation (
23).
The results demonstrate a pronounced hardware-enforced non-linearity in attacker cost as the Argon2id memory parameter approaches the GPU’s available VRAM. In particular, at 16–32 GiB, attacker throughput collapses due to the inability to parallelize memory-bound computations, effectively serializing brute-force attempts.
For linguistic secrets, even extreme 32 GiB hardening—while imposing substantial defender-side latency—remains vulnerable to compromise on the order of one year under the attacker model considered. In contrast, a single spatial point () under the same hardening parameters achieves an expected time-to-compromise of approximately years. Although this remains below the second BIP-39 threshold, it comfortably exceeds human and geological timescales, corresponding to the Human-Scale+ regime.
These results confirm that combining spatial entropy with high-memory KDF hardening fundamentally alters the brute-force landscape by enforcing hardware-level constraints on the attacker, thereby breaking the traditional security–usability trade-off observed for linguistic secrets.
5.5. Effect of KDF Chaining Depth on Brute-Force Resistance
KDF chaining (Equation (
13)) provides a user-controlled work factor multiplier that scales both sides of the security equation linearly. Because each invocation reuses the full
m-byte memory footprint and the serial data dependency prevents per-candidate parallelism, attacker throughput drops to
(
Table 12; conservative upper bound-CPU linearity ratios
confirm that the true attacker rate satisfies
) while defender latency grows to
(
Table 8).
Table 15 quantifies this trade-off at two memory tiers (
MiB and
MiB); all attacker time-to-compromise values are lower bounds on actual security.
Three conclusions follow. First, for low-entropy linguistic secrets (≈20 bits), chaining provides no practical rescue at either memory tier: even at
, the expected time-to-compromise remains below
s (Insecure zone) because the entropy deficit is too large for a linear multiplier to overcome. Second, for single-point spatial secrets, increasing
m from 1 GiB to 4 GiB multiplies the work factor by
at every
k level-a direct consequence of the
reduction in attacker throughput—yet all entries remain in the Human-Scale Secure zone. No combination of
k and
m evaluated here lifts an
configuration to the Super Secure zone; that threshold requires additional spatial points (
Section 5.6). Third, the 4 GiB column makes the defender cost explicit: the 10 s per-invocation wait is acceptable for cold-storage or archive scenarios, but is above the interactive 2 s budget at
. Users with a strict latency constraint should prefer
MiB with higher
k, or add a second spatial point, over increasing
m to 4 GiB.
5.6. Brute-Force Resistance via Combined Spatial and Computational Hardening
We analyze the joint effect of KDF chaining depth
k, multi-point spatial selection, and memory-hard key derivation on brute-force resistance. Since each factor independently multiplies the attacker work factor, their combination yields compounding security gains without additional memorization burden. As in the preceding subsections, spatial entropy is treated as a fixed input parameter (
Section 3.2.3), and security is evaluated exclusively via the attacker-adjusted work factor
defined in Equation (
22).
Figure 3 visualizes the combined effect: the Y-axis encodes the number of spatial points
n (top panel) or password length in characters (bottom panel), the X-axis is the attacker work factor on a log scale, and each colored line corresponds to a different Argon2id memory tier. The figure instantiates Equation (
23) using the empirically measured throughput values of
Section 5.2.
Table 16 reports defender-side latency and attacker time-to-compromise for spatial secrets with increasing numbers of points
n, evaluated across Argon2id memory settings ranging from 64 MiB to 8 GiB. Spatial entropy values for each point count are taken from
Table 4. Defender latency corresponds to CPU-side Argon2id execution, while attacker time-to-compromise reflects single-GPU brute-force attempts under the assumed attacker model.
The reported values show that increasing the number of spatial points rapidly dominates brute-force resistance, such that modest Argon2id memory settings already yield extremely large attacker work factors for . At the same time, defender-side latency increases only linearly with the Argon2id memory parameter, remaining below one second for memory settings up to 256 MiB and below five seconds at 2 GiB.
Relative to the BIP-39 equivalence criterion defined in
Section 4.5 (time-to-compromise exceeding
s), the table demonstrates that combined spatial and computational hardening achieves cryptographic-strength brute-force resistance without requiring extreme KDF parameters. This illustrates how spatial expansion of the search space reduces reliance on aggressive computational hardening, enabling strong security guarantees while preserving practical usability.
Targeted Attacker Scenario
The values in
Table 16 assume the global nominal spatial entropy (
bits). For a demographically informed attacker who knows the user’s city of residence—a scenario supported by findings in [
14,
15,
29]—the anchor entropy collapses to ≈
bits (Urban dictionary,
Table 3), a reduction of ≈
bits. For
, this brings the unaided spatial entropy below any practically secure threshold. However, the Argon2id hardening layer compensates: at
MiB, the attacker time-to-compromise for a single-point secret under the targeted prior is ≈
s (≈2 days), which falls in the Insecure zone. Increasing to
points restores the combined entropy to ≈108 bits even under the urban anchor, placing the work factor firmly in the Human-Scale Secure zone at all memory settings ≥64 MiB. This confirms that multi-point selection is essential for security against targeted adversaries.
6. Discussion
GeoVault rests on a concrete premise: instead of forcing users to memorize abstract symbol sequences, derive keys from places they already know. The results confirm that this shift is quantitatively meaningful. Spatially anchored secrets carry a substantially higher effective entropy floor than human-chosen passwords under the same attacker priors, and Argon2id hardening amplifies this margin non-linearly as memory costs approach GPU VRAM capacity.
6.1. Effective Entropy of Spatial Mnemonics
Spatial mnemonics provide a measurably higher baseline of effective entropy than linguistic secrets selected by humans. As shown in
Section 5.1, even pessimistic attacker models that restrict guesses to semantically meaningful spatial dictionaries, such as habitable land, coastlines, or urban regions, retain effective entropy levels on the order of 40–42 bits for a single spatial cell. This stands in contrast to human-chosen passwords and passphrases, which empirical studies consistently place well below 20 effective bits under realistic offline guessing strategies.
However, spatial entropy is not immune to demographic reduction. Research on geographic password systems [
14,
15,
29] documents that a targeted attacker who knows the user’s city of residence can restrict the search to an urban-scale dictionary, collapsing the anchor-point entropy from
bits to
bits—a reduction of
bits. This finding applies to the anchor point (the first selected location) and propagates into every subsequent point’s contribution: the total
n-point entropy shifts downward by the same fixed offset regardless of
n. Critically, this does not eliminate the utility of multi-point selection. As demonstrated in the targeted attacker analysis of
Section 5.6,
points under the urban prior still reach approximately 108 bits of combined spatial entropy, placing the work factor firmly in the Human-Scale Secure zone at all evaluated Argon2id settings. The 27.4-bit figure therefore reinforces rather than undermines the paper’s core recommendation: a minimum of
spatial points is required to maintain adequate security margins against a demographically informed adversary, regardless of the computational hardening applied.
This higher entropy floor directly shifts what is practically achievable with human-memorable secrets. Verbal recall collapses under selection bias; spatial recall preserves a larger and more evenly distributed search space. A small number of spatial selections—even under conservative attacker prior assumptions—provides substantially greater offline attack resistance than password-based constructions, with no requirement to memorize abstract symbol strings.
6.2. Impact of Memory-Hard Key Derivation
Spatial entropy alone does not reach the 128-bit security baseline for single-point configurations under a demographically informed attacker; Argon2id hardening is required to close this gap. The benchmarks in
Section 5.2 reveal a pronounced defender–attacker asymmetry as memory costs increase. Attacker throughput decreases non-linearly once per-instance memory requirements approach available GPU memory, while remaining feasible, though intentionally costly, for legitimate users operating on commodity CPUs.
This asymmetry arises because Argon2id binds performance to memory bandwidth and capacity rather than raw compute throughput. Under the evaluated attacker model, increasing parallelism does not efficiently compensate for memory exhaustion, resulting in sharp reductions in effective guess rates. Combined, spatial entropy and Argon2id hardening allow GeoVault to reach attacker-adjusted work factors at or above established cryptographic baselines with user-side latencies under 3 s on commodity hardware.
Hardware generalizability is quantified in
Section 5.2 (
Table 7): consumer laptops incur a
–
latency penalty, placing
MiB at 2–3 s. On the attacker side, a four-GPU RTX 4090 cluster raises
to ≈264 at
MiB—a
bit reduction in work factor that does not alter the security zone classification for any
configuration.
6.3. Security Gains from Multi-Point Spatial Selection
The combination of multiple spatial selections and computational hardening produces rapid growth in attacker-adjusted work factors. As shown in
Section 5.6, a small number of spatial points within a realistic dispersion radius, combined with modest Argon2id memory settings, yields work factors that approach or exceed the 128-bit security baseline. These gains come at sub-second to second-scale user-side latency, making the approach practical for interactive use.
Users can trade spatial complexity against computational hardness according to their threat model: more points reduce reliance on extreme KDF parameters, while higher m compensates for fewer points in low-n configurations.
A useful way to see this flexibility is to ask the following: For each value of n, what KDF chaining depth k is needed to reach the Super Secure zone ( s), and is the corresponding user-side wait time ( s) practical? Under the global attacker prior (45.7 bits per point), the answer for is , and for , it is (a wait of roughly 7.5 days)—both completely infeasible. At , the search space already exceeds the Super Secure threshold at with a 1.3 s wait, so no chaining is needed. Under the worst-case urban prior (27.4 bits per point, full demographic leakage of all selected locations), the work factor drops to s, which is solidly Human-Scale Secure but below Super Secure; bringing it above the s threshold via chaining alone would require , again infeasible. However, adding a fourth location keeps the defender burden identical (a single Argon2id invocation over four cell identifiers) and raises the base work factor to s. At this point, a chaining depth of only (user-side wait: s) crosses the Super Secure threshold: s. The practical implication is that users who face the worst-case demographic prior—where a targeted attacker can infer even the secondary and tertiary locations from public data—can compensate with one extra spatial point and a single-digit chaining depth, incurring less than 8 s of additional wait, without changing any other system parameter.
6.4. Threat Model and Limitations
The security properties of GeoVault are evaluated under a strong offline attacker model in which the adversary has unrestricted access to high-end GPU hardware but no auxiliary side-channel information beyond attacker-prioritized spatial dictionaries. The system does not rely on secrecy of the geospatial encoding scheme or obscurity of the spatial domain. Security derives solely from the effective entropy of user-selected locations and the enforced computational cost of key derivation.
The analysis does not account for targeted social engineering, coercion, or leakage of user-specific spatial preferences. As with all mnemonic-based systems, knowledge of a user’s habits, routines, or personal history—inferred from social networks, tagged photographs, or application location data—could reduce the effective search space if spatial choices are overly predictable. To quantify this risk, consider a worst-case public-information scenario where the attacker recovers the user’s home city from a social-media profile, reducing the anchor’s effective entropy from the global value of
bits to the urban-scale floor of
bits (
Section 3.2.3). If the attacker further infers—from tagged photographs and check-in history—that all
n selected locations lie within the same city, every spatial point collapses to the same urban prior, giving a total entropy of
bits. For
, this yields
bits, which—combined with the Argon2id work factor at
MiB—maintains an attacker-adjusted work factor above the Human-Scale Secure threshold. Complete demographic leakage across all
n points therefore represents the tight lower bound on effective entropy; in practice, secondary and tertiary points are rarely as publicly documented as the anchor, so the true exposure lies between this floor and the uninformed model. The core recommendation of
points with
GiB therefore remains sufficient even under full urban leakage of the anchor. Additionally, this work assumes accurate user recall within the resolution of the spatial encoding scheme. The current protocol provides no error tolerance for recall precision: a single-cell recall error (selecting a geographically adjacent 3 m × 3 m tile rather than the originally selected cell) produces a completely different KDF input string, causing 100% key derivation failure regardless of
n or
m. This is a fundamental usability limitation. Future work should explore error-tolerant extensions—such as a nearest-cell retry window or a spatially defined equivalence class—provided any such relaxation does not increase the attacker’s effective dictionary size beyond the security margins established in
Section 5.6.
A significant limitation of the present work is the absence of an empirical user study. All claims regarding the usability of spatial memory as a cryptographic input—including long-term retention of location-based secrets, recall accuracy under realistic conditions, reduced cognitive load relative to alphanumeric passphrases, and the practical tolerability of the Argon2id computation latency—are inferred from the cognitive psychology and geographic password literature rather than validated through direct experimentation with human participants. While the cited studies [
8,
9,
15,
26,
27,
28] provide strong indirect evidence that spatial memory is more robust and durable than linguistic recall, they were not designed to evaluate the specific
GeoVault workflow, and their findings may not transfer directly to the key derivation context. The central claim of the paper—that spatially anchored secrets offer a practical security and usability advantage over linguistic passwords—cannot be fully verified without a controlled user study. Future work should conduct such a study, measuring at minimum (1) long-term recall accuracy of encoded locations over periods of four or more weeks; (2) user error rates and the resulting key derivation failure rate; (3) user-perceived cognitive load compared to BIP-39 mnemonic phrases; and (4) empirical selection distributions within each spatial dictionary stratum, to calibrate the within-stratum fraction
(Equation (
9)) and validate the area-based entropy upper bounds that underlie
Section 5’s security zone classifications. A dedicated study should further examine accessibility for populations with reduced spatial cognitive ability, including elderly users and individuals with navigation-related impairments, for whom spatial recall may be a less reliable secret substrate.
Quantum Adversary Model
The threat model evaluated in this paper assumes a classical GPU-based attacker. Under a quantum adversary, Grover’s algorithm [
51] provides an asymptotic
speedup on unstructured search, which in bit terms, halves the effective entropy of any search space-based secret. Applied to
GeoVault, this reduces the per-point spatial entropy from
bits to ≈
bits under a quantum attacker with no demographic knowledge, and from
bits to ≈
bits under the worst-case urban prior. For
points under the uninformed quantum attacker, the total effective entropy is
bits; under the urban prior, it is
bits. Both values remain above the Human-Scale Secure threshold before any Argon2id hardening is applied. Shor’s algorithm [
52] is irrelevant to this construction: it attacks integer factoring and discrete logarithm problems that underlie asymmetric cryptography, not the symmetric hashing and memory-hard computation performed by Argon2id. Argon2id itself has no known quantum speedup beyond Grover; its memory-hardness argument is independent of the computational model. The practical implication is that
GeoVault’s
recommendation provides a meaningful security margin against a quantum attacker under the uninformed prior, but is insufficient under full urban leakage: an
configuration (
bits) restores a comfortable margin. A full post-quantum security analysis of memory-hard functions under more refined quantum random-access memory (QRAM) models is an open research problem and is beyond the scope of this work.
Physical side-channel attacks—such as timing analysis, power consumption profiling, or memory-access pattern leakage during Argon2id evaluation—are implementation-dependent and fall outside the scope of the protocol-level security model evaluated here. Their relevance and severity depend on the execution environment (browser, mobile app, hardware token) and are left for implementation-level security analysis in future work.
6.5. Implications for Human-Centered Key Management
Spatial memory is a viable and largely untapped resource for cryptographic key management. Unlike traditional brainwallets, where abstract linguistic recall collapses under selection bias, spatially anchored secrets maintain a substantially higher effective entropy floor under realistic offline attacker models. GeoVault shows that strong offline resistance does not require abstract symbol memorization: three freely selected map points suffice under the evaluated attacker priors.
The broader implication is that cognitive ergonomics deserves the same analytical rigor as computational hardness in the design of key management systems. This is especially pertinent where physical key storage is undesirable and long-term memorability is the overriding design constraint.
7. Conclusions
GeoVault demonstrates that replacing abstract symbol sequences with remembered geographic locations changes the security–usability trade-off in a quantitatively significant way. Users who struggle to reliably store a 12-word BIP-39 phrase can instead anchor a cryptographic key to a small set of familiar places—a spatial recall task that cognitive psychology consistently associates with strong, durable memory retention.
The entropy model quantifies this advantage directly: a single geographically typical location carries 45.7 bits of effective entropy under the global attacker prior—more than twice the effective entropy of a typical human-chosen password. When locations are combined with Argon2id hardening, brute-force cost grows non-linearly with memory cost by exploiting the VRAM bottleneck of GPU hardware, converting a user-side latency of 1.3 s at MiB into an attacker workload orders of magnitude larger; points are required to reach the Super Secure zone.
The key driver is spatial dimensionality. Selecting geographic points places GeoVault firmly in the Super Secure zone ( s) across all evaluated Argon2id memory tiers under the global attacker prior. Against a demographically targeted attacker with city-level knowledge ( bits per point), achieves Human-Scale Secure resistance ( s); with a chaining depth of restores the Super Secure zone at an ≈8 s user-side wait. Argon2id amplifies these margins but cannot compensate for insufficient spatial coverage: single-point configurations remain Insecure regardless of memory cost or chaining depth.
Open directions include empirical user validation of long-term recall accuracy, error-tolerant matching for approximate location recall, extension to indoor and three-dimensional environments, and shared-vault multi-user protocols. On the formal side, tighter characterization of information leaked by public location traces—social-media check-ins, geotagged photographs—would sharpen the targeted attacker model and clarify the conditions under which urban clustering fully eliminates the security margins computed here.
The GPU throughput advantage
(Equation (
30)) serves as a design decomposition: it isolates the KDF’s standalone contribution (
m-axis) from the search space contribution (
n-axis), allowing each to be sized independently. At the reference tier,
MiB,
—meaning the search space must contribute at least a factor of 66 in work beyond the KDF cost alone to reach any given threshold, a condition the
n-axis satisfies exponentially. KDF chaining depth
k raises absolute cost symmetrically but leaves
unchanged, confirming it acts on the same
m-axis as memory cost. Only raising
m beyond GPU VRAM capacity drives
toward unity. The architectural implication is that
does not need to reach 1: once spatial dimensionality is sufficient to absorb the residual throughput gap by orders of magnitude, further reductions in
via extreme memory cost impose usability penalties without meaningful security gains—a principle applicable to human-centered key management schemes beyond geographic location encoding.