A Hybrid Security Framework with Energy-Aware Encryption for Protecting Embedded Systems Against Code Theft

Kıyak, Cemil Baki; Bilge, Hasan Şakir; Yılmaz, Fadi

doi:10.3390/electronics14224395

Open AccessArticle

A Hybrid Security Framework with Energy-Aware Encryption for Protecting Embedded Systems Against Code Theft^†

by

Cemil Baki Kıyak

^1,2,*

,

Hasan Şakir Bilge

³

and

Fadi Yılmaz

⁴

¹

Department of Advanced Technologies, Graduate School of Natural and Applied Sciences, Gazi University, Ankara 06500, Türkiye

²

Hybrid and Electric Vehicles Technology Program, Vocational School, OSTİM Technical University, Ankara 06374, Türkiye

³

Department of Electrical and Electronic Engineering, Faculty of Engineering, Gazi University, Ankara 06570, Türkiye

⁴

Department of Computer Engineering, Faculty of Engineering and Natural Sciences, Ankara Yıldırım Beyazıt University, Ankara 06010, Türkiye

^*

Author to whom correspondence should be addressed.

^†

The work is derived from the doctoral dissertation “Designing Advanced Encryption Methodology on FPGA”.

Electronics 2025, 14(22), 4395; https://doi.org/10.3390/electronics14224395

Submission received: 8 August 2025 / Revised: 11 September 2025 / Accepted: 16 September 2025 / Published: 11 November 2025

Download

Browse Figures

Versions Notes

Abstract

This study introduces an energy-aware hybrid security framework that safeguards embedded systems against code theft, closing a critical gap. The approach integrates bitstream encryption, dynamic key generation, and Dynamic Function eXchange (DFX)-based memory obfuscation, yielding a layered hardware–software countermeasure to Read-Only Memory (ROM) scraping, side-channel attacks, and Man-in-the-Middle (MITM) intrusions by eavesdropping on communications on pins, cables, or Printed Circuit Board (PCB) routes. Prototyped on a Xilinx Zynq-7020 System-on-Chip (SoC) and applicable to MicroBlaze-based designs, it derives a fresh Authenticated Encryption with Associated Data (AEAD) key for each record via an Ascon-eXtendable-Output Function (XOF)–based Key Derivation Function (KDF) bound to a device identifier and a rotating slice from a secret pool, while relocating both the pool and selected Block RAM (BRAM)-resident code pages via Dynamic Function eXchange (DFX). This moving-target strategy frustrates ROM scraping, probing, and communication-line eavesdropping, while cryptographic confidentiality and integrity are provided by a lightweight AEAD (Ascon). Hardware evaluation reports cycles/byte, end-to-end latency, and per-packet energy under identical conditions across lightweight AEAD baselines; the framework’s key-derivation and DFX layers are orthogonal to the chosen AEAD. The threat model, field layouts (Nonce/AAD), receiver-side acceptance checks, and quantitative bounds are specified to enable reproducibility. By avoiding online key exchange and keeping long-lived secrets off Programmable Logic (PL)-based external memories while continuously relocating their physical locus, the framework provides a deployable, energy-aware defense in depth against code-theft vectors in FPGA-based systems. Overall, the work provides an original and deployable solution for strengthening the security of commercial products against code theft in embedded environments.

Keywords:

code theft protection; authenticated encryption; Ascon; key derivation; Ascon-XOF; FPGA; Dynamic Function eXchange (DFX); bitstream encryption; side-channel resistance; lightweight cryptography

1. Introduction

Code theft, direct non-volatile readout (a.k.a. ‘ROM scraping’), side-channel attacks and communication eavesdropping pose a serious problem for commercial products based on embedded-system integrated circuits [1,2,3,4]. These threats jeopardise embedded-code security and intellectual property. ROM scraping refers to the theft of code extracted directly from a ROM through physical access. Although it is feasible on ordinary integrated circuits, it is even more difficult on FPGAs, especially inside MicroBlaze designs within an SoC architecture.

The default obfuscated structure of the MicroBlaze IP and the bitstream-encryption option in Xilinx, make code extraction from ROM almost impossible [5,6,7,8]. MicroBlaze, as a soft-core processor embedded in the Programmable Logic (PL) section of the FPGA, stores its code in volatile configuration memory. By nature this memory is non-linear and obfuscated. The C code compiled for MicroBlaze is converted into this hardware configuration, becoming an integral part of the FPGA fabric; therefore, extracting it through physical techniques such as ROM scraping is extremely difficult. This structure fulfils the reverse-engineering-prevention goal that obfuscated algorithms are meant to serve against the problems defined in the threat model [1,2]. In the Zynq-7020 build of this study, security-critical C-code fragments are placed in AXI-mapped BRAMs inside reconfigurable partitions and are periodically relocated and scrubbed via DFX (PCAP), so the physical locus of both secrets and selected code pages also moves rather than relying solely on MicroBlaze/configuration-memory obfuscation (Figure 1).

Side-channel attacks aim to obtain sensitive data by analysing power consumption or electromagnetic emissions. Bitstream encryption provides resistance to such attacks. Communication eavesdropping, in turn, tries to understand code functionality by analysing the data flow across pins, which can enable indirect code inference, especially through man-in-the-middle methods. The threat model assumes that an attacker may capture the commercial, intellectual-property-class software inside the integrated circuit by scraping the ROM, obtaining keys via side-channel analysis, or analysing communication patterns to infer code behaviour. To counter such threats on resource-constrained systems, lightweight AEAD primitives (e.g., ASCON-128 family that currently chosen as a standard by NIST) and a per-message keying policy are adopted for prioritising low energy and bounded latency in embedded deployments [9,10,11].

Comprehensive, advanced studies on embedded-system security in the literature ordinarily focus solely on FPGAs, yet specific solutions that directly address—and effectively prevent—the disclosure of code functionality through communication-line eavesdropping are limited [12]. For example, the relevant work can be grouped under five main headings. Bitstream Protection: Guin [5], tackled counterfeit integrated-circuit threats and proposed bitstream encryption; Obfuscation: Engels [13], assessed the security of logic locking but did not emphasise communication security; IoT Security: Silva [12], investigated lightweight encryption algorithms yet did not address code-functionality protection; Algorithm Hopping: Soliman [14], showed that algorithm hopping together with Dynamic Partial Reconfiguration (DPR that currently replaced with DFX) improves security; and Physical-Layer Key Generation: recent systems (e.g., MobileKey) derive symmetric keys from wireless-channel reciprocity on mobile devices; these works focus on bootstrapping keys between peers rather than preventing on-chip code theft or communication-semantics leakage targeted here [15,16,17]. Specific countermeasures that prevent the disclosure of code functionality via communication eavesdropping therefore remain scarce. This gap is precisely what the proposed method seeks to fill. This study proposes a low-energy method that (i) derives per-message AEAD keys via an Ascon-XOF KDF from a small encrypted pool and a pre-shared root secret bound to the device ID and a timestamp/nonce, and (ii) relocates the encrypted pool and selected code pages in BRAM via DFX as a moving target [18,19,20]. Evaluation is performed on Zynq-7020 hardware (and an Arduino-class solver) using cycles/byte, end-to-end latency, and 12 V/9.6 V energy measurements. Attack simulations are also made to prove the resistance of the methodology against current well-known AEAD standards.

Consequently, it has been observed that most existing solutions concentrate on bitstream encryption or on optimising the energy and execution time of general cryptographic algorithms, yet the literature lacks sufficient mechanisms that prevent the inference of code behavior from data captured through communication eavesdropping. Furthermore, academic FPGA-security proposals have been documented as largely theoretical and as failing to meet commercial requirements [8,21]. This academic–commercial mismatch has generated the expectation that the method proposed in this study, by addressing IP theft in addition to commercial requirements such as energy efficiency and practical deployability, will fill this gap. No specific threat model or countermeasure directly addressing the extraction of code functionality by communication eavesdropping or ROM scraping has been identified in the literature; current studies typically prioritize energy and time optimisation and, therefore, overlook this particular threat. Accordingly, practicality, security and energy efficiency are evaluated against representative lightweight AEAD baselines widely used on constrained devices (Ascon-128a, ACORN, TinyJAMBU, JAMBU) under identical measurement conditions [10,22,23,24,25].

Because no directly comparable study targets the combined threat of code-functionality inference via communication eavesdropping together with ROM scraping, the proposed hybrid security framework is evaluated against representative lightweight AEAD baselines widely used on constrained devices (e.g., Ascon-128a, ACORN, TinyJAMBU). The comparison focuses on cycles per byte, end-to-end latency, and energy at 12 V and 9.6 V under identical measurement conditions, while the HSF layer contributes dynamic per-message keying and DFX-based memory relocation orthogonal to the chosen AEAD. Historical PSK-based stream ciphers, such as A5/1, are referenced only as a contrast case to motivate authenticated encryption and to highlight why static-key LFSR designs are unsuitable for this threat model [26].

The hybrid security framework proposed in this study combines bitstream encryption, dynamic key generation, and memory obfuscation through Dynamic Function eXchange (DFX) to counter the threats described above [6,14,21,27]. Bitstream encryption, available as an integrated option in Xilinx Vivado, blocks side-channel attacks on the ROM, whereas the ROM of the MicroBlaze soft processor located inside the SoC Block Design module is already obfuscated and is therefore naturally resistant to physical scraping attacks. In the implementation of Zynq-7020 in this study, relying on any fixed ROM location is also avoided: the security-critical code segments and tables that would otherwise reside in ROM were placed into two DFX-relocatable BRAM regions in the PL and are periodically swapped and scrubbed via PCAP, so even their physical locus keeps moving. Against the communication-eavesdropping threat, the study proposes deriving dynamic per-message keys via an Ascon-XOF KDF from an encrypted pool and a pre-shared root secret (as a PSK) added with the device ID and an optional timestamp/nonce. Although this method is formally similar to schemes that rely on pre-shared keys, such as A5/1—which, despite all reported vulnerabilities, is still in use in GSM networks—it provides greater security [26].

An A5/1-like structure (stream cipher) was not selected here because, due to its known cryptanalytic weaknesses, 64-bit key length, and non-obfuscated structure, the single root key can be obtained directly from RAM with greater likelihood [3,26]. In contrast, the proposed method increases the entropy and removes reliance on a root key by employing per-message 128-bit AEAD keys derived from (pool + root + device ID + [opt. timestamp/nonce]). In this context, the method introduces an original threat model and a dedicated solution that fill a gap in the literature.

Because the contribution is a hardware–software hybrid that hides communication semantics at run time and relocates both secrets and selected code segments in PL BRAM via DFX, the appropriate baselines are lightweight AEAD ciphers commonly used on embedded targets—not asymmetric key-exchange protocols. Accordingly, Hybrid Security Framework (HSF) is evaluated as a per-message keying wrapper around standard AEADs (e.g., Ascon-128a by default, with ACORN, JAMBU and TinyJAMBU as alternatives) and time/energy are reported on the same setup. Legacy PSK stream ciphers (e.g., A5/1) are cited only to motivate the choice of dynamic, per-message keys—versus single-root-key schemes that are exposed to known attacks. This framing isolates the novelty (dynamic keying + DFX-relocated BRAM for secrets and code) and aligns the evaluation with the stated threat model. A5/1 employs a 64-bit key and generates session sub-keys from a root key by means of a Linear Feedback Shift Register (LFSR) but is regarded as weak by modern standards [26]. The proposed method uses per-message 128-bit AEAD keys (Ascon family) and relocates BRAM contents via DFX. DFX does not increase cryptographic key entropy; it expands the attacker’s physical search space.

One of the important sub-components in the proposed hybrid method is the use of DFX, which conceals the physical location of the secret pool and selected code pages by continually relocating RAM blocks and addresses. In this process, RAM addresses are reconfigured randomly and meaningless data are inserted into empty areas, making it difficult for an attacker to separate genuine data from decoys. Thus, the proposed method not only increases entropy but also impedes correlating communication patterns with code behaviour, thereby preventing code theft [8]. In practice, relocating BRAM-resident code pages together with secret material further breaks stable leakage templates and frustrates physical scraping or probing.

In the design of the proposed method, the energy-efficiency advantages of pre-shared keys (PSKs) were taken into account. This consideration is also critical for resource-constrained embedded systems and similar IoT devices. Security protocols such as the asymmetric Diffie–Hellman, whose protection generally relies on key-exchange procedures, are unsuitable for energy-critical embedded systems owing to their high computational cost [12,28].

The method offers an alternative solution to the problem defined in the aforementioned threat model by generating dynamic keys without requiring real-time key exchange. During this process, the reduction of energy consumption and latency was also considered. The method is designed to derive a session-specific key in each communication session by generating dynamic keys without real-time key exchange: a random element is drawn from the encrypted pool and combined with root key and the device ID; an Ascon-XOF KDF derives the per-message AEAD key from these inputs.

Threat model. An adversary capable of probing off-chip buses (DDR/QSPI/AXI), attempting physical readout of on-chip arrays, and passively observing I/O to infer program behaviour is assumed. In response, only encrypted secret-pools transit AXI and BRAM instances holding secrets are periodically destroyed and recreated via DFX to desynchronise physical locality from logical content. Relocating BRAM-resident code pages together with the secret pool further desynchronises physical locality from logical content and frustrates scraping/probing templates.

The proposed method was intentionally made complex so that, in addition to being low-cost in terms of energy and time, it would resist the hardware attacks described in the threat model. Consequently, its complexity was increased with the aim of contributing to hardware obfuscation in that context [4,6,7]. Here, the role of obfuscation is to enhance security by increasing system complexity, thereby enriching Kerckhoffs’s principle—that security should rely on the secrecy of the key rather than the secrecy of the algorithm—in a modern context. Responding to the unique security needs faced by FPGA-based systems, this innovative method supplements traditional approaches with implementation-level obfuscation. It also takes into account Shannon’s [29] dictum that any system can be broken given sufficient time and significantly hinders an attacker’s efforts by making reverse engineering of the obfuscated structure more difficult. For this reason, the method additionally aims to extend the attacker’s expenditure of time and resources. Within this context, DFX provides protection against statistical analysis by continually changing RAM addresses and inserting meaningless data [8]. Therefore, although lightweight, this purpose-built hybrid structure—evaluated from multiple perspectives and without compromising security—provides sufficient complexity to prevent even indirect inference of code behaviour, because even if a key is broken only a single message can be captured.

An optional provisioning phase can pre-establish a root secret and parameters under bitstream encryption, enabling both endpoints to deterministically derive the same encrypted pool from a TRNG-seeded seed/salt via Ascon-XOF—without transferring static keys [30,31,32]. A timestamp-trial variant may be used, but this option is orthogonal to the evaluated threat model and deferred to future work. The provisioning option preserves low energy and latency by avoiding online key exchange on constrained devices. Importantly, DFX enlarges the physical search space against scraping/probing, whereas shell cryptographic entropy remains 128 bits.

Although this issue lies outside the threat model, the study also proposes an optional mechanism. This mechanism aims to provide an infrastructure for future work that tackles weaknesses arising from the “key-embedding” approach and from other encryption gaps noted in the literature. Accordingly, the provisioning phase is discussed but kept outside the formal scope of the solution proposed for the main threat model. Nevertheless, the study flags this separate issue for future research by offering a preliminary solution path. For this reason, theoretical or experimental comparison topics have not been addressed for this phase.

As a result, this study fills an important gap in the current literature by presenting an energy-aware and practical solution against code theft in FPGA-based systems [8,12,21]. Therefore, (i) the definition of a specific threat model for extracting code functionality by eavesdropping on communication lines and the presentation of a hybrid security solution against this; (ii) the justification—using examples such as A5/1—of the suitability of pre-shared keys for energy efficiency and practical applicability in IoT environments [26], (iii) the detailed analysis, in the light of Kerckhoffs’s principle, of the contribution of DFX-based obfuscation techniques to real-world security by increasing attack complexity and time even if the algorithm is known [6,8,21] and (iv) demonstrating that the additional energy and latency costs incurred by the proposed method, when compared with representative lightweight AEAD baselines on the same hardware, are at an acceptable level for a practical solution [33].

Note. Public access to the open-source comparison of proposed KDF method versus known AEAD baselines has been provided to facilitate reproducibility and future research, https://github.com/tkopter/Proposd_HSF (accessed on 15 September 2025).

2. State of Art

Field-programmable gate arrays (FPGAs) have become indispensable in embodied computing systems ranging from consumer electronics to critical infrastructure. Their reconfigurable nature enables rapid development and deployment but also exposes designs to a spectrum of threats such as intellectual property theft, side-channel analysis, fault injection, remote Trojan insertion and protocol-level attacks. Conventional countermeasures focus on bitstream encryption or static logic locking, yet the last decade has seen a proliferation of attacks demonstrating that confidentiality alone is insufficient. At the same time, resource-constrained devices such as Internet-of-Things (IoT) nodes have pushed researchers to develop lightweight cryptography and energy-aware security modules. This section presents a comprehensive survey of recent work on code protection and dynamic partial reconfiguration, emphasising energy efficiency and practical deployment.

2.1. Bitstream Attacks and Code Protection

Despite vendor-supplied AES-based bitstream encryption, the secrecy of configuration data has been repeatedly compromised. Beginning in 2011, Moradi and co-authors showed that the built-in bitstream decryption engine in Xilinx Virtex and Altera Stratix FPGAs leaks the encryption key through side–channel analysis, enabling complete key recovery without breaking the AES algorithm [34,35]. Later works generalised the attack to newer devices and even demonstrated fault-injection techniques to extract keys from encrypted bitstreams [36]. These results led to the realisation that standard bitstream encryption alone offers limited protection. A seminal warning came from Johnson et al., who exploited remote dynamic partial reconfiguration (DPR) over Ethernet to insert hardware Trojans into a running FPGA design, thereby leaking AES keys and tampering with embedded processors [37]. Their work highlighted that dynamic reconfiguration, if unprotected, becomes an attack vector.

To counteract these threats, researchers have proposed binding bitstreams to specific devices and obscuring logic at the circuit level. Maes et al. introduced physically unclonable function-based key generation, where the decryption key is derived from silicon process variations rather than stored in non-volatile memory [38]. Van Herrewege and colleagues improved the reliability of such PUF-based schemes for Xilinx 7-series FPGAs [39]. Obfuscation is another line of defence: Karam et al. proposed mapping truth tables into unused “dark silicon” regions of the FPGA to create a key-dependent bitstream that remains functional only when the correct key is applied at run-time [7]. Engels and Shafique analysed logic locking and concluded that many schemes succumb to SAT or structural analysis [13], motivating the need for more robust mechanisms. Recently, Stolz et al. integrated hardware and software obfuscation: their LifeLine framework splits a cryptographic algorithm across FPGA fabric and embedded processor and assembles it at run-time using DPR, thereby thwarting static analysis [8].

In parallel, the academic community has experimented with automated Trojan insertion and bitstream tampering. Ender et al. demonstrated the first full break of Xilinx 7-series bitstream encryption, dubbed the “Starbleed” attack, which recovers both key and data by abusing undocumented configuration commands [40]. Kataria et al. showed that flipping just a few bits in the bitstream of a Cisco Trust Anchor module disables the integrity check and allows arbitrary modifications [41]. Automated toolchains now exist to inject hardware Trojans into arbitrary bitstreams [42], further illustrating that obfuscation and run-time checks are necessary.

2.2. Dynamic Reconfiguration and Lightweight Cryptography

Partial reconfiguration has been used to improve security by changing the hardware configuration at run-time. Soliman et al. introduced an “algorithm hopping” security module that randomly swaps among five authenticated-encryption ciphers using DPR [14]. By multiplexing one cipher core instead of instantiating all five, their design reduces the static power consumption by roughly 80% and occupies only 58% of the LUTs of a monolithic implementation. Samir et al. took a complementary approach by loading ciphers of different complexity according to available battery power: their energy-adaptive HSM saves about 60% of LUT resources and caps dynamic power at 10 mW while still providing authenticated encryption [43]. Wei et al. applied DPR to physical unclonable functions, switching between three different ring-oscillator PUF layouts on the fly to increase entropy and resilience against modelling attacks [44]. Sunkavilli et al. proposed DPReDO, a dynamic obfuscation scheme that periodically reconfigures small circuit segments to hinder hardware Trojans; they reported up to 80% reduction in Trojan success rate with less than 3% area overhead [45].

Beyond reconfiguration, a large body of work explores lightweight cryptography for FPGAs. Sasdrich et al. implemented a moving-target version of the PRESENT block cipher whose architecture changes each clock cycle to resist side-channel attacks [46]. Faraj and Gebotys exploited photonic emissions to extract secrets from SRAM and emphasised the need for photonic shielding in cryptographic FPGAs [3]. Bommana et al. combined DPR with deep-learning-based side-channel countermeasures to achieve adaptive protection [47]. Several recent papers examine chaotic and biometric schemes on FPGAs, such as Lorentz-chaos ciphers [48], EEG-driven pseudo-random generators [49], quantum-chaos hybrids [50], multi-dimensional chaotic maps [51], and dynamic S-box designs [52]. Ciylan et al. presented a systolic-array chaos convolution cipher on a Virtex-7 platform, measuring 55% LUT utilisation and 280 mW power consumption [53]. The diversity of these proposals underscores the search for lightweight yet secure cryptography suitable for resource-constrained FPGAs. For completeness, physical-layer key-generation systems (e.g., MobileKey and recent surveys) address peer key bootstrapping and are orthogonal to our goal of preventing on-chip code theft and communication-semantics leakage [15,16,17].

Despite the progress above, existing works typically address either energy efficiency or bitstream confidentiality but rarely both in the context of an integrated system. Legacy stream ciphers such as A5/1 are now regarded as insecure in modern threat models due to time-memory trade-off and correlation attacks as they employ pre-shared 64-bit keys and linear feedback shift registers [26,54,55]. The proposed Hybrid Security Framework instead combines authenticated encryption (lightweight AEAD) with per-message key derivation for a lightweight AEAD via an Ascon-XOF KDF (from an encrypted pool bound to a pre-shared root secret, device ID, and an optional timestamp/nonce) [10], and memory obfuscation via Dynamic Function eXchange (DFX) to provide defence in depth.

No approach is found to explicitly target the combined threat of inferring program functionality from eavesdropped I/O while resisting physical readout of on-chip contents under embedded energy/latency constraints. This gap motivates pairing a lightweight AEAD with per-message keys and run-time relocation of security-critical BRAM contents via DFX, which enlarges the physical search space without a need to claim extra key entropy. The following tables compare the energy consumption of the proposed approach with representative prior works, summarise the gap in the literature and highlight the key trade-offs. Table 1 summarises representative works from the past decades, outlining the main limitations that motivate the HSF approach.

3. Materials and Methods

This section describes the methods used to implement a hybrid security framework that couples per-message authenticated encryption with run-time relocation of secret storage and selected code pages in AXI-mapped BRAM via Dynamic Function eXchange (DFX). The description focuses on the encryption algorithm hardware design; measurement and benchmarking details are reported elsewhere.

3.1. Hybrid Security Framework (HSF): Design and AEAD Interface

A 256-bit root secret (

S_{ROOT}

), a device identifier (dev_id, 32-bit, big-endian on the wire), and a pool of 128-bit slices (

POOL []

) reside in AXI-mapped BRAM. Pool entries are 128-bit windows over the 256-bit root at non-sequential offsets; the

i d x \to

slice mapping is kept in BRAM and, together with selected code pages, is periodically relocated by Dynamic Function eXchange (DFX), so the physical locus of both secrets and code becomes a moving target. Authenticated encryption uses ASCON-128a (v1.2); policy-controlled key derivation is performed via ASCON-XOF128. Nonces and AAD are public but integrity-bound; acceptance is tag-only.

Keying and Domain Separation (KDF Policies). ASCON-XOF128 is modeled as a PRF/RO keyed by the 256-bit

S_{ROOT}

over public context. Two deployment policies are supported; both apply explicit domain separation via a protocol string dom [18,19,20].

Policy A (per-slice key).
$K_{i} = ASCON-XOF 128 (" KDF | ASCON | v 1 " ∥ S_{ROOT} ∥ dev_id ∥ POOL [i d x])$
Keys are per-device/per-slice; nonce uniqueness per key remains mandatory. A compromise of $K_{i}$ affects packets using the same $(dev_id, i d x)$ until rekey or slice rotation.
Policy B (per-message key, optional).
$K_{i} = ASCON-XOF 128 (" KDF | ASCON | v 1 " ∥ S_{ROOT} ∥ dev_id ∥ POOL [i d x] ∥ epoch ∥ msg)$ Keys are per-packet; a compromise of one $K_{i}$ does not help on other packets, even with full knowledge of $(dev_id, i d x, epoch, msg)$ .

In both policies,

S_{ROOT}

never leaves the device, and the protocol string dom provides cross-protocol domain separation. Under Policy B,

(epoch, msg)

additionally define a per-packet domain [10,18,19]; therefore exposure of one

K_{i}

does not help derive any

K_{j \neq i}

even with full knowledge of

(dev_id, idx, epoch, msg)

.

Nonce and AAD Construction. The public nonce is 16 bytes and packs dev|epoch|msg as big-endian fields:

nonce = pack 128 (dev_id [4], epoch_ctr [4], msg_ctr [8])

where reuse is prevented by a monotone policy on

(epoch_ctr, msg_ctr)

[10,62].

Operational rules are as follows: (i) Increment

epoch

on reboot; (ii) keep

msg

monotone; (iii) rekey (new

i d x

or a new

S_{ROOT}

derivative) well before counter wrap. Uniqueness per key is mandatory for AEAD security.

Rationale. The dev field binds the endpoints and enables rejection of packets from the wrong device; epoch prevents counter collisions after reboot; msg enforces monotonicity that provides per-epoch freshness and aids transport synchronisation. These fields are not secret; carrying them in the clear does not introduce a vulnerability, because acceptance depends solely on verification of the authentication tag under the correct key. Associated data (AAD) is integrity-protected by the AEAD tag and uses either a 15-byte layout ver|dev|idx|msg|feat or a 23-byte extension that appends an optional ts field. Here, ver encodes protocol-version compatibility; dev enforces endpoint matching; idx indicates which slice is used; msg supports synchronisation with the transport layer and controls replay; and feat carries application-specific policy bits. When present, the optional ts field enables timestamp validation. The AAD is not encrypted; however, it is cryptographically bound to the AEAD tag, so any bit flip causes verification to fail (INT-CTXT) [9]. When Policy B is selected,

(epoch, msg)

also enter the KDF for per-packet domain separation; under Policy A they do not.

Slice Pool. The slices over

S_{ROOT}

are 128-bit sliding windows of the 256-bit string; their starting offsets are chosen from the ordinary set

s \in [1, 129]

, and the selection order is non-monotonic (e.g., the fifth key may use the window

[100 . . 227]

when

s = 100

). The idx field in the AAD is merely a label indicating which window is used; because the window’s contents remain secret, exposing idx does not create a vulnerability. Even if the same idx is reused, AEAD security is preserved by nonce uniqueness; the residual risk is a possible concentration of key leakage on a single idx, which is mitigated by random idx selection and DFX. Slices in POOL are 128-bit windows over the 256-bit

S_{ROOT}

chosen at non-sequential offsets (e.g.,

[1! :! 128], [100! :! 227], \dots

); the policy avoids a fixed ordering so that an observer cannot map public indices to stable root sub-ranges.

Initialization. Initialization loads

S_{ROOT}

,

dev_id

, and

POOL []

into BRAM; sets

(epoch_ctr, msg_ctr)

; and enables DFX relocation/scrubbing for the BRAM regions that host the secret pool and selected code pages. On reboot, increment

epoch_ctr

to prevent nonce reuse. At startup, the DFX controller allocates two BRAM regions for secrets and selected code and enables periodic relocation.

Steady-state. For each message, build the nonce from

(dev_id, epoch_ctr, msg_ctr)

, derive

K_{i}

according to Policy A or Policy B using

(S_{ROOT}, dev_id, POOL [i d x] [∥ epoch ∥ msg])

, compute

(C, tag) = ASCON-128a_Enc (K_{i}, nonce, AAD, M)

, and transmit the record. The receiver enforces the checks in Section 3.1.1 and accepts only on successful tag verification. Reusing the same idx in consecutive messages does not degrade security as long as nonces are unique; under Policy A a compromised key affects packets with the same

(dev_id, i d x)

until rekey, whereas under Policy B compromise is confined to a single packet.

3.1.1. Receiver Side Acceptance Checks

A packet is accepted if all conditions [L1]–[L9] hold; otherwise it is rejected. The items below mirror the concise checklist in Table 2.

L1.: Lengths valid: $| nonce | = 16$ , $| tag | = 16$ , $| AAD | \in {15, 23}$ . (format sanity)
L2.: Index & device consistency: $i d x < | POOL |$ ; $d e v_{AAD} = d e v_{nonce}$ . (single source of truth)
L3.: Message consistency: $m s g_{AAD} = m s g_{nonce}$ . (single source of truth)
L4.: Version/feature policy: policy bits ver/feat, AAD layout valid. (compatibility gating)
L5.: Anti-replay: $(e p o c h, m s g)$ is fresh under the policy monotone or sliding window. (replay resistance)
L6.: Optional timestamp policy: if $t s$ is present, enforce $t s \geq t s_{last}$ or bound. (audit/window when enabled)
L7.: KDF derive (base): $K_{i} = ASCON-XOF 128 (" KDF | ASCON | v 1 " ∥ S_{ROOT} ∥ dev_id ∥ P O O L [i d x]) [0 . . 127]$ .
L8.: KDF (optional separation): append $e p o c h ∥ m s g$ if policy requires per-message domain separation.
L9.: AEAD verify: $ASCON-128a_Dec (K_{i}, nonce, AAD, C, t a g)$ succeeds. (INT-CTXT)

Table 2. Receiver checks (concise).

Step	Condition (Reject on Failure)
L1	$\| nonce \| = 16$ , $\| tag \| = 16$ , $\| AAD \| \in {15, 23}$
L2	$i d x < \| POOL \|$ ; $d e v_{AAD} = d e v_{nonce}$
L3	$m s g_{AAD} = m s g_{nonce}$
L4	Policy bits: `ver`/`feat` and AAD layout valid
L5	Anti-replay: $(e p o c h, m s g)$ fresh (monotone or window)
L6	Optional timestamp: if $t s$ present, enforce $t s \geq t s_{last}$ or window bound
L7	$K_{i} = ASCON-XOF 128 (" KDF \| ASCON \| v 1 " ∥ S_{ROOT} ∥ dev_id ∥ P O O L [i d x]) [0 . . 127]$
L8	(Optional) Append $e p o c h ∥ m s g$ to KDF input for per-message domain separation (Policy B)
L9	$ASCON-128a_Dec (K_{i}, nonce, AAD, C, t a g)$ succeeds

See Table 3 for full field semantics.

Security note. dev/idx/msg/epoch are public by design; secrecy is provided by

K_{i}

and integrity by the tag [30,31,32]. Leaking idx or dev does not weaken confidentiality because

S_{ROOT}

and POOL[idx] remain unknown, while

(e p o c h, m s g)

ensure per-packet key separation. Under Policy A (per-slice), compromising a derived key impacts packets using the same

(dev_id, i d x)

until rekey or slice rotation. Under Policy B (per-message), including

(epoch, msg)

in the KDF confines a compromise to that packet. In both policies, nonce uniqueness per key remains mandatory.

3.1.2. Operational Procedures

Ciphertexts and tags are produced by ASCON-128a. Acceptance is exclusively by tag verification; no plaintext markers are used. The probability of tag forgery is approximately

2^{- 128}

; to mitigate application-level timing side channels, tag comparison is implemented in constant time [10]. Damage containment follows Policy A/B as defined in the KDF Policies subsection. Figure 2 summarises the KDF inputs, Nonce/AAD construction, receiver-side checks, and the DFX-assisted BRAM relocation of both secret pools and selected code pages.

DFX does not increase cryptographic key entropy; rather, it enlarges the attacker’s physical search space against scraping/probing attacks and destabilizes leakage templates. This effect is orthogonal to the cryptographic keyspace. Pool size and index selection are parameters: a larger pool reduces accidental reuse probability of the same slice; the index idx is public in AAD, whereas the slice content remains secret. DFX relocation and BRAM scrubbing enlarge the physical search space against scraping/probing; cryptographic key entropy remains defined by the 128-bit key length. Only the idx label and the AAD/nonce headers are visible to an eavesdropper; the idx → slice mapping is hidden within BRAM and is periodically relocated via DFX, so observing idx on the clear channel does not permit inference of the slice contents or of

S_{ROOT}

.

Parameter roles. The tag authenticates the ciphertext and verifies its integrity; the AAD is transmitted in the clear but, being cryptographically bound to the tag, is not modifiable without detection, and the KDF (ASCON-XOF) derives a 128-bit session key from

S_{ROOT}

, dev_id, and the secret POOL[idx]. Thus confidentiality (IND-CPA) and integrity (INT-CTXT) are ensured. Damage containment follows Policy A/B as defined in the KDF Policies subsection. For each packet, the session key

K_{i}

is derived via ASCON-XOF128 according to Policy A or Policy B [9].

Encryption. Given plaintext M and header fields

(i d x, epoch_ctr, msg_ctr, v e r, f e a t [, t s])

: (i) read

POOL [i d x]

and

dev_id

from BRAM; (ii) derive

K_{i}

by ASCON-XOF128 as above; (iii) build nonce and AAD; (iv) compute

(C, tag) = ASCON-128a_Enc (K_{i}, nonce, AAD, M)

; (v) emit a single UART line

PKT; ver = 1; nonce_hex = \dots; aad_hex = \dots; ct_hex = \dots; tag_hex = \dots .

Decryption. Upon receipt, the receiver: (i) checks lengths and layout of nonce/AAD/tag; (ii) verifies

i d x < | POOL |

and that the dev field in AAD equals the one in the nonce; (iii) enforces anti-replay on

(epoch_ctr, msg_ctr)

; (iv) derives

K_{i}

with ASCON-XOF128 using

(S_{ROOT}, dev_id, POOL [i d x])

; (v) accepts only if

ASCON-128a_Dec (K_{i}, nonce, AAD, C, tag)

returns success, then delivers M and updates the replay window. Failed verifications are logged and the

(epoch, msg)

window is not advanced; for accepted packets, the window is advanced.

Field semantics and security rationale. The packet exposes a fixed set of public fields (Nonce, AAD) and hides secret material (key, ROOT, slice). Table 3 summarises each symbol used in Figure 2, its scope/size, provenance, and the exact security/verification role; _N denotes “from Nonce”, and _AAD denotes “from AAD”.

Algorithm 1 HSF–ASCON Encrypt/Decrypt (Pseudocode)

1:: function HSF_Encrypt( $M, i d x, e p o c h, m s g, v e r, f e a t [, t s]$ )
2:: $K \leftarrow ASCON-XOF 128 (" KDF | ASCON | v 1 " ∥ S_{ROOT} ∥ dev_id ∥ P O O L [i d x]) [0 . . 127]$
3:: % Optional (Policy B): append $(e p o c h ∥ m s g)$ before truncation for per-message
separation
4:: $nonce \leftarrow pack 128 (dev_id, e p o c h, m s g)$
5:: $AAD \leftarrow ver | dev | idx | msg | feat [| ts]$
6:: $(C, t a g) \leftarrow ASCON-128a_Enc (K, nonce, AAD, M)$
7:: return PKT;ver=1;nonce_hex=…;aad_hex=…;ct_hex=…;tag_hex=…
8:: end function
9:: function HSF_Decrypt( $PKT$ )
10:: parse $nonce, AAD, C, t a g$ ; require $| nonce | = 16$ , $| AAD | \in {15, 23}$ , $| tag | = 16$
11:: require $i d x < | POOL |$ and $d e v_{AAD} = d e v_{nonce}$ ; anti-replay check on $(e p o c h, m s g)$
12:: $K \leftarrow ASCON-XOF 128 (" KDF | ASCON | v 1 " ∥ S_{ROOT} ∥ dev_id ∥ P O O L [i d x]) [0 . . 127]$
13:: $(o k, M) \leftarrow ASCON-128a_Dec (K, nonce, AAD, C, t a g)$
14:: return M if $o k$ else Reject
15:: end function

Optional provisioning. An optional provisioning phase may establish a seed/salt under bitstream protection so that both endpoints deterministically derive the same

POOL []

without transferring static keys on the wire. When provisioning messages are sent, ASCON-128a is used for confidentiality and integrity; details of the mechanism are outside the evaluated threat model and are summarised here to support reproducibility [30,31,32].

3.2. Experimental Setup

All measurements were performed on hardware; the primary platform was a Xilinx Zynq-7020 (Snickerdoodle Black) configured via Vivado/Vitis 2020.2. The evaluated ciphers comprised ASCON-128a and the lightweight AEAD set (ACORN-128, TinyJAMBU-128, JAMBU–PRESENT-128) [10,22,23,24,25]; the Hybrid Security Framework (HSF) was exercised as a per-message keying wrapper around ASCON AEAD, while ACORN-128, TinyJAMBU-128, and JAMBU–PRESENT-128 were evaluated in their reference configurations. Secrets and selected code pages resided in two AXI-mapped BRAM regions inside partially reconfigurable partitions; Dynamic Function eXchange (PCAP) periodically swapped and scrubbed these regions during tests [63,64]. Execution time was measured on the Zynq Processing System using the ARM Global Timer and PMU cycle counters [65]. Each packet emitted a single UART line containing; [time] us_total|us_kdf|us_aead|us_glue, [pmu] cyc_total|cyc_kdf|cyc_aead, PKT;ver=1;…; which was stored as a log. Here, us_glue covers non-cryptographic overhead (packet framing/parsing, UART I/O, buffer copies) outside KDF/AEAD; PMU counters (cyc_∗) correspond accordingly. Instantaneous current on the input rail was measured through a precision shunt and a Digilent Analog Discovery 3 in oscilloscope mode; energy per packet was obtained by integrating

P (t) = V_{in} I (t)

over the software window aligned to the AEAD call. Where available, readings were cross-checked with a PicoTest M3511A 6 1/2 multimeter by measuring current over main power line under steady-state load to validate average current; per-packet energy was derived from the oscilloscope integration.

An Arduino Due acted as a receiver/attack harness: it parsed Nonce and AAD fields (dev, idx, epoch, msg), derived

K_{i} = ASCON-XOF 128 (“ KDF ” ‖ S_{ROOT} ‖ dev ‖ POOL [idx]) [0 . . 127]

, invoked

a s c o n_{a e a d 128 a} - d e c r y p t

, and compared tags in constant time. Negative tests flipped tag bits and replayed stale

(epoch, msg)

tuples to confirm anti-replay.

Correctness. Implementations were cross-checked against official test vectors for ASCON, ACORN, TinyJAMBU, and JAMBU; encryption and verification outputs matched bit-for-bit.

Two BRAM regions were used in the reported setup; using four or eight regions is straightforward and further enlarges the physical search space for scraping/probing without changing cryptographic entropy as well as increasing randomness for higher resistance against possible attacks.

Additionally, Arduino Due was used for the analysis of decryption and attacks. Energy consumption analyses were conducted in the hardware environment, measurements on FPGA and Arduino Due evaluated the algorithm’s energy efficiency in real-world scenarios. All components were carefully configured to prove the algorithm’s effectiveness and measure its performance differences with traditional methods. Each tool focused on a specific performance metric or attack type, enabling a comprehensive analysis of the study. Thus, the algorithm’s performance and security in both simulation and hardware environments were examined in detail. The experimental setup also considers the practical implementation challenges in commercial embedded systems, addressing the academic-commercial misalignment highlighted in prior studies [8,21].

3.3. Hardware and Software Design

3.3.1. Platform & Top-Level Design

The security objective is two–fold: (i) to keep code and long–lived secrets out of easily–scrapable, externally–observable memories; and (ii) to make the physical locus of sensitive data a moving target through Dynamic Function eXchange (DFX). A prototype runs on Zynq-7020 via PCAP; the method also ports to MicroBlaze (as the soft processor’s instruction memory kept in PL BRAMs that can be moved via DFX too). Using the hard-core ARM keeps control trusted while two reconfigurable BRAMs in PL store encrypted state, and selected BRAM-resident code pages are likewise relocated via DFX (see Figure 3). The proposed framework was implemented on a Xilinx Zynq-7020 using Vivado/Vitis 2020.2. The current design does not use MicroBlaze; all software executes on the Zynq Processing System (dual-core Cortex-A9). Secrets and selected BRAM-resident code pages in PL are relocated via DFX. Two partially reconfigurable regions (RMs) are defined in the programmable logic (PL) using floorplanning with Pblock constraints. The static design comprises the Zynq PS, an AXI SmartConnect, and two partially–reconfigurable modules (RMs) bounded by Pblocks. Each RM contains an AXI BRAM Controller and a Block Memory Generator instance.

3.3.2. Memory & Configuration Architecture

Each RM can be independently reconfigured at runtime using Dynamic Function eXchange (DFX). During normal operation one BRAM holds the encrypted secret POOL (slices) and selected code pages, while the second BRAM is prepared off-line. PCAP loads a partial bitstream and swaps the two BRAM roles. Constant BRAM relocation hides key location and blocks side-channel/probing attacks. Figure 4 depicts the internal structure of each RM that contains an AXI Block RAM (BRAM) Controller connected to a Block Memory Generator instance. The first BRAM occupies addresses 0x4000_0000 – 0x4000_7FFF and the second resides at 0x4200_0000 – 0x4200_7FFF, providing two 32 KiB on-chip memories that is alternately activated, destroyed and recreated via DFX for secure key and word-list storage. The SmartConnect interconnect and addressing are conventional AXI as the novelty lies in continuously relocating the physical storage cells rather than only updating contents (Figure 3 and Figure 4). On Zynq-7000 devices the PS programs the PL through PCAP, exposed to software by the DEVCFG (device configuration) peripheral; PCAP supports handling full/partial, encrypted bitstreams. [63,66]. An ICAP primitive also exists in PL for high–bandwidth in-fabric reconfiguration; using it from Zynq requires handing ownership from PCAP to ICAP and driving ICAP via custom logic or a DFX controller [66]. PCAP is therefore used for all DFX actions to keep the trusted control plane within the ARM PS. Table 4 summarises the main differences among these interfaces. In this section, the available memory types on a Zynq-7020 and their suitability for cryptographic key storage are compared. The design deliberately uses BRAM as it is on-chip, AXI-visible through both PS/PL, and DFX-relocatable. Non-volatile eFUSE or battery-backed BBRAM are used by Xilinx tools to store the 256-bit AES key for bitstream encryption, but their one-time-programmable nature and limited capacity make them unsuitable for dynamic key lists. Off-chip DDR and QSPI memories offer high capacity but can be probed through bus analysis and therefore are reserved for non-secret code storage. On-chip OCM (on-chip memory) is accessible only to the PS and cannot be relocated via DFX. Using four or eight reconfigurable BRAM regions is straightforward; increasing the number of regions enlarges the physical search space for scraping/probing in order to add more randomness without changing cryptographic entropy. For M BRAMs with K addresses each, single-cell location uncertainty is

H_{loc} = {log}_{2} (M K)

[63,64].

In Xilinx devices three configuration paths exist. PCAP links the PS to the configuration engine and is the sole full-config interface on Zynq-7000. PCAP accepts encrypted and authenticated bitstreams, supports both full and partial reconfiguration and is controlled by the devcfg peripheral; however, its throughput is limited by the AXI slave port and it ties up the PS during transfers. The Internal Configuration Access Port (ICAP) resides in the programmable logic and can be driven by custom hardware for high-bandwidth partial reconfiguration. ICAP cannot decrypt bitstreams—encrypted frames must be passed through PCAP first—and switching from PCAP to ICAP requires quiescing the configuration engine and clearing control bits in the DEVCFG register, as described in UG909 Vivado Partial Reconfiguration User Guide [66]. Finally, UltraScale and Versal devices introduce a Management Configuration Access Port (MCAP) that exposes the configuration bus through PCIe; MCAP is absent in Zynq-7000 and therefore irrelevant to the current implementation. PCAP is retained in this design because it integrates naturally with the ARM processing system and Vivado’s Dynamic Function eXchange flow [63,64].

For completeness, Table 5 extends the earlier memory comparison by enumerating all principal RAM and ROM resources in the Zynq-7020. The processing system contains separate 32 KiB instruction and data caches per Cortex-A9 core and a shared 512 KiB L2 cache; these caches are transparent to software and unsuitable for storing cryptographic keys. A 256 KiB on-chip memory (OCM) serves as tightly coupled RAM for fast code and data. Dual-channel DDR3 memories (PS DDR) provide several megabytes of volatile storage, whereas off-chip Quad SPI (QSPI) flash holds the non-volatile primary bitstream and user data. Within the programmable logic, distributed RAM exploits LUTs for small FIFOs and registers, BRAM offers 18–36 KiB blocks, and UltraRAM would provide larger 288 KiB blocks on devices that support it (not available on XC7Z020). The device also includes a 256-bit eFUSE array and a battery-backed RAM (BBRAM) for storing the AES decryption key used by the built-in bitstream decryption engine. Each of these memories has distinct access rights and volatility characteristics; only BRAM satisfies the combined requirements of on-chip storage, relocatability via DFX and sufficient capacity for the secret-pool.

On-chip BRAM is relocatable for security/performance, and PCAP lets the PS drive encrypted DFX on Zynq-7000 [63,66]. MCAP does not exist on this device class; ICAP would add PL logic and control complexity without clear security benefit in the present threat model. Hence, using PCAP instead of ICAP simplifies the software by allowing the PS to control reconfiguration directly; the devcfg driver abstracts the details of the configuration bitstream and ensures synchronisation with the running system. Finally, eFUSE/BBRAM are reserved and used for the device-unique AES key that protects the PL bitstream itself and are not appropriate stores for frequently changing secret-pools [67]. Per-rotation re-keying ensures each relocated list has fresh ciphertext, eliminating repeat-pattern leaks to prevent side-channel and scraping attacks. Thus, the chosen configuration maximises security without introducing external interfaces that could be probed (see Table 5 and Table 6).

3.3.3. Security, Run-Time & Implementation Details

The attacker model includes (i) delayering/probing of the package or board (bus snooping on DDR, QSPI, AXI), and (ii) invasive readout of on-chip arrays (Table 6). Therefore; (a) keep long-lived keys out of off-chip media; (b) place only encrypted secret-pools and auxiliary state in PL BRAM; (c) ensure that any cleartext material lives only in PS core registers for a few cycles; and (d) continuously destroy and recreate the physical BRAM instance via DFX (Figure 4). This moving-target, encrypted-transport strategy raises the bar for both physical scraping and correlation/power analysis.

Reproducibility/Future work. To harden deployments against bus probing and power/EM analysis, an optional per-relocation re-encryption layer can be added. Before writing to the freshly loaded RM, a device-bound transport key is derived via ASCON-XOF128 from a long-term secret and a monotone rotation counter; POOL entries are then sealed under ASCON-128a (AEAD). Thus, even if two relocations carry the same logical contents, their ciphertexts on the AXI path and in BRAM become statistically independent across rotations. This mechanism was not enabled in the evaluated build; it is included here to facilitate reproduction and future comparative studies.

K_{tr}^{(r)} = ASCON-XOF 128 (" TR " ∥ S_{ROOT} ∥ dev_id ∥ {epoch}_{r} ∥ RM_id ∥ rot_ctr) [0 . . 127]

(1)

Rationale. Binding

K_{tr}^{(r)}

to

(S_{ROOT}, dev_id)

ties the transport key to a single device, while the monotone

rot_ctr

ensures uniqueness across relocations; the domain-separation tag "TR" prevents cross-use with other XOF invocations. Adding extra fields (e.g., epoch, RM identifiers, content digests) increases input size without material security gain under a correct

rot_ctr

policy and nonce-unique AEAD use, hence the compact form is preferred for clarity and portability.

On Zynq-7000, partial bitstreams are streamed through PCAP by the XDevCfg driver (header xdevcfg.h), whose typical call path includes XDcfg_CfgInitialize (binds the DEVCFG instance), XDcfg_Transfer (DMA push to PCAP), and status/ISR helpers (e.g., XDcfg_IsDmaBusy, XDcfg_IntrGetStatus) to synchronise reconfiguration. In contrast, the XilFPGA Vitis service (XFpga_∗ API, e.g., XFpga_PL_BitStream_Load) targets UltraScale(+)/ZynqMP/Versal platforms and is not the canonical path on Zynq-7000; the BSP uses XDevCfg with PCAP as recommended for 7-series based SoCs [63,66]. At boot, a secure configuration sequence loads the static PL through PCAP; decryption of the PL bitstream uses the AES key stored in eFUSE or BBRAM (device-bound) [67].

After initing AXI, the per-message keys are derived via ASCON-XOF128 from

(S_{ROOT}, d e v,

POOL [i d x], e p o c h, m s g)

, and ASCON-128a is used for authenticated encryption. Only ciphertext and tags, together with public Nonce/AAD, transit AXI into the active BRAM. Following each message, the PS triggers a DFX event: a partial bitstream loads the alternate RM over PCAP (via XDcfg_Transfer), the newly loaded BRAM becomes active, and the old region is scrubbed. On the receiver (e.g., Arduino Due), the same (idx,epoch,msg,dev) regenerates the per-message key via ASCON-XOF128 and ASCON-128a verification/decryption completes the receive path (demonstration of cross-platform feasibility). Only the encrypted payload and freshly re-encrypted secret-pool entries transit AXI into the active BRAM each rotation.

All crypto and key tasks run in the PS via Vitis C code as follows: Upon system start, a secure boot sequence configures the static PL design via PCAP. The AES decryption key for the bitstream resides in eFUSE or BBRAM, but this key is unrelated to the dynamic keys used by the proposed method. The processor initialises the SmartConnect and AXI BRAM Controllers and loads an initial partial bitstream for RM1 or RM2 through PCAP. Dynamic keys are generated by concatenating S_ROOT, dev_id, and POOL[idx] and expanding the result with ASCON-XOF128 to produce a 128-bit key. The keys are stored in the active BRAM region as encrypted POOL entries and auxiliary state. A message is encrypted using ASCON-128a (AEAD) and transmitted. Before each DFX swap, derive a fresh transport key

K_{tr}^{(r)}

with ASCON-XOF128 and re-encrypt the active secrets in PS registers, then write the new ciphertext set to the target RM’s BRAM. Only then issue XDcfg_Transfer() to load the partial bitstream and flip roles; registers are zeroized afterwards. After each encryption, the PS triggers a DFX event via XDevCfg, causing the inactive RM to be loaded with the next partial bitstream while the active one is scrubbed. This constant relocation of BRAM thwarts physical probing and correlation attacks. Decryption on the receiving side (e.g., Arduino Due) reverses the process by regenerating the same per-message key and completing verification/decryption with ASCON-128a.

MicroBlaze was previously mentioned to be placed in PL for its naturally-obfuscated instruction memory to be relocated via DFX by using ICAP. It is now kept the trusted control and DFX orchestration in the Zynq PS to (i) leverage PCAP and its driver stack, (ii) minimize PL resource overhead, and (iii) isolate long–lived secrets from off-chip memories. A MicroBlaze variant is straightforward; the soft core would issue reconfiguration commands through an ICAP controller in PL; its instruction/data BRAMs can be made reconfigurable and shuffled with the same DFX flow. Thus, Zynq-based build is a didactic vehicle; the security argument (moving-target BRAM + encrypted transport + register-only cleartext lifetime) carries over verbatim. MicroBlaze benefits from an obfuscated configuration memory that complicates ROM scraping; however, its use incurs significant resource overhead and restricts clock frequency. The Zynq-7020’s hard ARM Cortex-A9 processing system provides higher performance, integrated peripherals, and a simpler software stack for dynamic partial reconfiguration via PCAP, and it is still resistant to side-channel and scrape attacks because of the obfuscated structure of the proposed encryption algorithm’s methodology. It is important to emphasise that the method is platform-agnostic—nothing prevents a MicroBlaze implementation—and the choice of Zynq here merely illustrates the concept on an energy-aware SoC. A MicroBlaze variant would use the ICAP interface for partial reconfiguration and store keys in BRAM or distributed RAM; the security arguments and dynamic key-generation protocol remain unchanged. Thus, presenting the design on Zynq demonstrates that DFX-driven memory relocation and dynamic key generation can protect communication and data—which is essential for guessing the algorithm—even when the processor core and its caches are fully transparent. MicroBlaze is suggested for extra resistance to ROM scraping, but the proposed HSF encryption algorithm’s obfuscated structure still has enough resistance to scraping or side-channel attacks; thereby answering potential concerns about why a soft core was not employed, as the demonstration has successfully shown the applicability for demonstration.

On Zynq-7000/Vitis 2020.2 the BSP exposes XDevCfg: initialize with XDcfg_CfgInitialize(), set up the PCAP for DMA, push the partial bitstream buffer with XDcfg_Transfer(), and poll/ISR-clear with XDcfg_IsDmaBusy() and XDcfg_IntrGetStatus(). On UltraScale(+)/ZynqMP platforms the analogous operation would use XilFPGA (XFpga_PL_BitStream_Load), but this service is not the canonical path on Zynq-7000 [66].

Software handles key generation and ASCON-128a authenticated encryption/decryption. Example C code in Vitis demonstrates this flow, and the results illustrate dynamic key derivation and message encryption. In the demonstrated Zynq implementation, all software runs on the hard ARM Cortex-A9 processing system (PS) of the Xilinx Zynq-7000 SoC. Instead of relying on a MicroBlaze soft core and obfuscated instruction memory in the PL, the current design derives security from the algorithm’s obfuscated structure, keeps long-lived secrets out of off-chip memories, and stores only encrypted secret-pool and ephemeral session material in two AXI-mapped BRAM regions in the programmable logic (PL). After each transaction, Dynamic Function eXchange (DFX) via the Processor Configuration Access Port (PCAP) swaps the active BRAM region with an alternate one and scrubs the old instance, turning the physical locus of sensitive data into a moving target. Cleartext keys exist only transiently in PS core registers, which are practically unattainable via non-destructive probing. This DFX-driven moving-target memory and encrypted transport provides resistance against ROM scraping, bus probing, and reverse engineering, meeting the threat model without the resource overhead of a soft processor (PCAP transfers are issued via the XDevCfg driver on Zynq-7000). Hence, a MicroBlaze-based realization is a viable alternative when energy and latency constraints are relaxed, leveraging ICAP-driven DFX and PL-resident instruction BRAMs to add further obfuscation if desired. Otherwise a Zynq PS solution is still robust against the mentined attacks owing to the obfuscated algorithm structure of proposed HSF.

This example highlights the generation of a unique per-message keying, providing significant security advantages by isolating the potential impact of any compromised keys. Arduino-based decryption verified the algorithm on a microcontroller. Arduino Due acted as a cross-platform receiver to parse

N o n c e

,

A A D

, derive

K_{i}

via ASCON-XOF128, and verify/decrypt with ASCON-128a. Code example of Arduino decoder part demonstrates this process using the same (dev, idx, epoch, msg) to derive

K_{i}

via ASCON-XOF128 and obtain Nonce and AAD in order to verify/decrypt with ASCON-128a. This code demonstrates the applicability of the decryption process on the Arduino Due. The code examples confirm that the algorithm operates effectively in both FPGA and microcontroller environments.

3.3.4. Manuscript Preparation and AI Tool Usage

During the preparation of this manuscript, the authors utilized a generative AI tool (OpenAI’s ChatGPT, and Google’s Gemini) for assistance with secretarial tasks. Its use was limited to improving readability, correcting grammar and syntax, and formatting references. Any AI tool is not used as a material or a method of the study, except formatting. The authors assume full responsibility for all content, including the final verification of any AI-assisted outputs.

4. Results

This section reports hardware measurements on a Xilinx Zynq-7020 for encryption (producer side) and an Arduino Due for decryption/attack harness (solver side). The evaluated set comprises ASCON-128a, ACORN-128, TinyJAMBU-128, and JAMBU–PRESENT-128 in their reference forms, and the Hybrid Security Framework (HSF) applied as a per-message keying wrapper around ASCON-128a. Latency and cycle counts were captured on Zynq via the ARM Global Timer and PMU; current/voltage were measured on the supply rail as detailed below. The sub-stage timings and cycle breakdown that appear in Table 7 and Table 8 are taken directly from the PMU logs and consolidated tables (Zynq sub-stage cycles and μs: HSF–ASCON

12 μ s

total with

6.63 μ s

KDF and

5.26 μ s

AEAD; ASCON-128a

6 μ s

total with

5.52 μ s

AEAD; ACORN-128

500 μ s

; TinyJAMBU-128

172 μ s

; JAMBU–PRESENT-128

366 μ s

).

4.1. Measurement Setup and Discipline

A precision series shunt of

R_{s} = 18.408 Ω

was inserted on the

12 V

input to the Zynq board. The Digilent Analog Discovery 3 sampled the shunt drop

V_{shunt} (t)

. Instantaneous current and load voltage follow

I (t) = \frac{V_{shunt} (t)}{R_{s}}, V_{load} (t) = 12.0 - V_{shunt} (t), P (t) = I (t) V_{load} (t) .

Crypto windows operate in an observed

V_{load} \approx 8.5 - 9.5 V

span; window energies therefore use per-window

V_{load}

, not a fixed value. A PicoTest M3511A Multimeter provided an independent

12 V

average-current cross-check (

P_{12} = 12 I_{12}

) used only for traceability in tables; per-window energies are reported from the shunt method. On the Arduino platform, power was treated as constant at

12 V \times 1.8 mA = 21.6 mW

, so energy scales linearly with duration using

E (μ Wh) = 6 \times 10^{- 6} \cdot duration (μ s)

A representative shunt capture illustrates three plateaus: (i) a low-current UART transmit window, (ii) an idle+crypto window enclosing KDF+AEAD execution, and (iii) a short over-current DFX window due to partial reconfiguration. “Delta” rows in the window table (Table 9) report signed differences relative to the stated baseline (

Δ V_{load} = - Δ V_{shunt}

) (Figure 5).

The “AEAD delta (vs UART)” quantity in the subsequent window table isolates the incremental energy of a short encryption over the lower-current UART plateau, separating cryptographic work from logging overhead. The DFX window corresponds to the relocation step; its 4-RM single-shot duration/energy are measured directly, and the 2-RM single-shot values are obtained by linear scaling.

4.2. Producer-Side Encryption on Zynq-7020

Table 7 consolidates producer-side encryption results on the Zynq-7020, reporting PMU/GT–derived cycle counts and durations for the HSF–ASCON wrapper and the reference AEADs together with their sub-stages (KDF, init, aad, msg, fin). Power and energy are obtained at the crypto plateau using the shunt-derived load operating point and, for traceability, complementary 12 V PicoTest measurements aggregates; energies follow

E_{μ Wh} = P \cdot Δ t_{μ s} / 3600

. The table thus quantifies the per-message computation cost of the producer under a uniform measurement discipline, with host-side glue excluded.

The producer table reports the cost of one protected message under HSF (KDF+AEAD) at the crypto operating point; the 12 V column shows PicoTest measurements aggregates for traceability. The stage and sub-stage durations/cycles originate from the PMU logs and consolidated tables.

4.3. Solver-Side Decryption on Arduino Due

Table 8 isolates solver-side decryption on the Arduino Due under the same per-message framing. Operation at a constant

12 V

and

1.8 mA

(21.6 mW) yields energies that scale linearly with duration; the HSF–ASCON total and its parse/KDF/AEAD breakdown are listed explicitly and are consistent with the consolidated figures. These values capture the microcontroller cost to parse inputs, derive per-message keys, and verify/decrypt.

The HSF–ASCON total, and its parse/KDF/AEAD breakdown, are computed at 21.6 mW and match the consolidated energy table (e.g., 0.00577 µWh/op for HSF–ASCON on Due).

Security note. ASCON/ACORN/TinyJAMBU use 128-bit tags (AEAD); JAMBU–PRESENT uses a 64-bit tag; PRESENT-ECB is a non-AEAD baseline.

4.4. Energy Attribution and DFX Scaling

This subsection attributes energy to transport, cryptography, and relocation by combining the plateau segmentation in Table 9 with the per-algorithm summary in Table 10. The “AEAD delta (vs UART)” quantity isolates the incremental encryption work within the idle+crypto window, while the DFX rows correspond to single-shot partial reconfiguration. The measured 4-RM window provides the per-swap reference; a 2-RM single-shot scales approximately by one half in both duration and energy under identical operating conditions, and the summary table reports both variants explicitly. The “AEAD delta (vs. UART)” row isolates the incremental energy of a short encryption over the lower-current UART plateau, separating cryptographic work from logging overhead. The DFX window exhibits a short over-current associated with partial reconfiguration; the table lists both total and delta-versus-idle contributions.

Per-algorithm summary (crypto-only vs. relocation). Cipher-to-cipher comparisons adopt the idle+crypto operating point for the producer:

P_{enc} = I_{enc} V_{load, enc}

from the crypto plateau; per-cipher energy equals

E_{alg} = P_{enc} Δ t_{alg}

using the measured latency. HSF–ASCON is reported twice in the comparison table: (i) crypto-only (KDF+AEAD, no relocation) and (ii) crypto+DFX, where the relocation cost is made explicit.

Relocation policy scaling (4-RM measurement → 2-RM deployment). Stress experiments exercised four BRAM-backed reconfigurable modules (RMs) to probe relocation, whereas the target deployment uses two RMs. The DFX cost in Table 9 represents a per-swap window (one partial bitstream load). For a policy that performs

N_{swap} (M)

swaps per message with M regions,

E_{DFX} (M) = N_{swap} (M) E_{DFX}^{per - swap} (M), t_{DFX} (M) = N_{swap} (M) t_{DFX}^{per - swap} (M) .

Using the measured per-swap values from Table 9 for the 4-RM bitstream (

t_{DFX}^{per - swap} (4) = 180.8 ms

; delta energy

E_{DFX}^{per - swap} (4) = 11.75 μ Wh

; total window energy

81.016 μ Wh

), a 2-RM policy with

N_{swap} (2) = 1

(single-shot PR of a half-size bitstream) yields

t_{DFX}^{per - swap} (2) \approx 90.4 ms

and

E_{DFX}^{per - swap} (2) \approx 5.875 μ Wh

(delta), consistent with linear scaling of the measured per-swap window under identical operating conditions.

The results consolidate per-message computation and energy costs across both platforms under a common framing. On the producer side (Zynq-7020), cycle counts and timings come from the ARM PMU/Global Timer, while energies are obtained by plateau-based series-shunt integration over the observed

8.5 - 9.5 V

load range; shunt captures were acquired with a Digilent Analog Discovery 3, and 12 V average-current aggregates measured with a PicoTest M3511A Multimeter are used solely as a cross-check (Table 7). On the solver side (Arduino Due), operation at constant

P = 21.6 mW

makes energy proportional to duration, and the parse/KDF/AEAD breakdown is listed explicitly (Table 8). Window segmentation separates UART, idle+crypto, and DFX contributions and exposes the “AEAD delta (vs UART)” as the incremental encryption cost (Table 9). The comparison distinguishes crypto-only HSF–ASCON from HSF–ASCON+DFX and makes relocation overhead explicit; under single-shot partial reconfiguration, DFX scales linearly so a 2-RM policy incurs approximately half the duration and energy of the measured 4-RM window (Table 10).

DFX frequency policy. Relocation can be decoupled from every-packet operation: a policy of one single-shot 2-RM swap every N messages adds

\approx 90.4

ms and

Δ E \approx 5.875 μ

Wh per swap (measured), leaving the crypto-only path unchanged.

4.5. Security Analysis and Attack Experiments

This section evaluates HSF–ASCON against brute-force, forgery/replay, side-channel, timing, hardware-tampering, dynamic key-recovery, and dictionary attacks. Session keys are derived via ASCON-XOF128 from the device-internal

S_{ROOT}

,

dev_id

, and

POOL [i d x]

; Policy B optionally appends

(epoch, msg)

to achieve per-packet separation. Nonces are deterministic

pack 128 (dev, epoch, msg)

with uniqueness per key; acceptance is tag-only under ASCON-128a (128-bit tag). Under Policy A (per-slice), compromise of a derived key impacts packets sharing

(dev, i d x)

until rekey; under Policy B (per-message), compromise is confined to the affected packet. DFX-based relocation increases the physical search space for probing/template attacks but does not alter cryptographic key entropy.

The method directly addresses threats such as ROM scraping and communication interception in embedded deployments. Bitstream encryption and memory obfuscation impede code extraction via physical or indirect means, while per-message keying localizes any exposure to a single ciphertext.

Side-channel considerations are twofold. First, software-based authenticated decryption (ASCON-128a) executes in a compact, fixed-control flow that limits timing variability; constant-frequency operation further reduces timing side channels. Second, dynamic partial reconfiguration (DFX) eliminates stable physical layouts by relocating BRAM-backed modules between messages, thereby degrading power/EM templates. Under a placement model with R relocation slots and M reconfigurable modules, the per-message configuration space is

P (R, M) = R! / (R - M)!

, yielding

H_{DFX} = {log}_{2} P (R, M) bits

. For a deployment with

R = 4

slots and

M = 2

modules,

H_{DFX} \approx {log}_{2} (12) \approx 3.6

bits per single-shot placement; a 4-RM stress configuration provides

{log}_{2} (24) \approx 4.6

bits. These bits complement, but do not replace, cryptographic key entropy; independence assumptions should be treated conservatively in security claims.

Correctness Note. DFX does not add to key entropy; it only enlarges the physical search space for scraping/probing. For the single “secret cell” model,

H_{loc} = {log}_{2} (M K)

; with

M = 10

BRAMs and

K = 1024

addresses,

H_{loc} \approx {log}_{2} (10240) \approx 13.3 bits

. If the real data are dispersed across k cells out of

M K

cells and the remainder are filled with decoys, the combinatorial uncertainty is

H_{comb} = {log}_{2} (\binom{M K}{k})

(e.g.,

k = 10 \Rightarrow H_{comb} \approx 111.4 bits

;

k = 50 \Rightarrow \approx 451.7 bits

). Model applies for the deployment should be specified according to the requirements.

Hardware manipulation attacks on FPGA platforms (e.g., clock tampering or hardware Trojans) are countered using device-locked bitstreams and silicon identity. Vivado’s Device DNA and bitstream encryption confine bitstreams to the intended device, impeding unauthorized modification and reuse [8,37]. Timing attacks are further mitigated by a fixed clock source and constant-time AEAD verification paths [8,37]. Dynamic key resolution attacks are addressed by per-message derivation from high-variability inputs (e.g., random word, timestamp, optional device identity etc.), preventing key prediction across messages hence there is no “random word/timestamp” inputs are used as KDF in HSF.

Assumptions on PR-channel integrity. Partial bitstreams are loaded over PCAP with device-bound encryption and integrity checks; toolchain versions and patch levels mitigating known issues (e.g., Starbleed on 7-series) are documented [40]. Threats requiring a compromised PR channel are out of scope.

Terminology (security accounting). q: total number of adversarial attempts/queries (verification or forgery tries). D: number of keys (e.g., devices/sessions); M: messages per key;

Q = D \cdot M

(fleet-wide trials). RO/PRF: modeling the XOF/KDF as a Random Oracle or a PRF keyed by

S_{ROOT}

over public context.

Quantitative AEAD bounds. Forgery probability for a t-bit tag under Q total attempts is bounded by

Pr [forge] \leq Q / 2^{t}

. In multi-user settings with D devices and M messages per device,

Q = D \cdot M

. Counter-based nonces avoid birthday collisions; if random L-bit nonces were used,

P_{coll} \approx Q^{2} / 2^{L + 1}

.

Multi-user.

D = 10^{4}

,

M = 10^{6}

\Rightarrow Q = 10^{10}

.

\begin{matrix} t = 64 : & Pr [forge] \leq \frac{10^{10}}{2^{64}} \approx 5.4 \times 10^{- 10} \approx 2^{- 30.8} \\ t = 128 : & Pr [forge] \leq \frac{10^{10}}{2^{128}} \approx 2.9 \times 10^{- 29} \approx 2^{- 94.8} \end{matrix}

Per-device.

D = 1

,

M = 10^{6}

\Rightarrow Q = 10^{6}

.

\begin{matrix} t = 64 : & Pr [forge] \leq \frac{10^{6}}{2^{64}} \approx 5.4 \times 10^{- 14} \approx 2^{- 44.1} \\ t = 128 : & Pr [forge] \leq \frac{10^{6}}{2^{128}} \approx 2.9 \times 10^{- 33} \approx 2^{- 108.1} \end{matrix}

Random-nonce reference (not used).

L = 128

,

Q = 10^{10}

draws,

P_{coll} \approx \frac{10^{20}}{2^{129}} \approx 1.5 \times 10^{- 19} \approx 2^{- 62.6}

Data limit for 64-bit blocks. For PRESENT-based AEAD (JAMBU–PRESENT-128), the total number of processed 64-bit blocks under a single key should be kept

≪ 2^{32}

due to the birthday bound; in practice, rotate keys around

2^{28 . . 30}

blocks as a safety margin. For a rotation point of

2^{29}

total 64-bit blocks per key, the maximum number of messages before rekey is approximately the following (Table 11):

Cipher suite and fairness. JAMBU–PRESENT-128 emits 64-bit tags (

Q / 2^{64}

), whereas ASCON/ACORN/TinyJAMBU use 128-bit tags (

Q / 2^{128}

). Reported performance tables annotate this security-level difference. PRESENT-ECB is used solely as a core-speed baseline; practical deployments require an AEAD mode (e.g., JAMBU) (Table 12).

Side-channel methodology and PR-channel assumptions. Code paths follow fixed control flow; table-based S-boxes are avoided where applicable; tag comparison is constant-time. Dynamic Function eXchange relocates BRAM-resident pools and selected code pages, breaking stable templates; relocation is orthogonal to cryptographic entropy but increases the attacker’s physical search space. PR-channel integrity is assumed: device-bound, encrypted bitstreams with verified tooling mitigate unauthorized reuse or tampering. Implementations follow constant-time coding; table-based S-boxes are avoided (bit-slicing is also recommended for PRESENT). Future work includes DPA/CPA measurements (fixed frequency), leakage coefficients, and template robustness. DFX-induced relocation is used to destabilize persistent templates.

Attack experiments (Arduino Due, 84 MHz). Practical brute-force throughput and leakage isolation were evaluated on an Arduino Due (SAM3X8E, 84 MHz). ASCON-128a verification cost from logs is

\approx 224 μ s

(

\approx 18, 816

cycles consistent with

F_{CPU} = 84 MHz

, cycles ≈

μ s \times 84

); the brute-force speed trial measures tag-verification throughput (integrity check per second) rather than cryptanalytic success (Table 13). Throughput trials enumerate random 128-bit keys and count verification attempts/s; leakage trials contrast (i) Plain ASCON with a single session key K assumed leaked versus (ii) HSF–ASCON with a per-message key

K_{i}

assumed leaked (Table 14). Measurements were sanity-checked against the power model reported in the Results section (constant

P = 21.6 mW

on Due); no energy anomalies were observed.

Per-attack notes. (Table 15) Brute force: Per-message 128-bit key and 128-bit tag; at the measured

\approx 4.4 \times 10^{3}

/s on Due, the search space remains unreachable (ETA

(50 %) \approx 1.2 \times 10^{27}

years). Side-channel: Fixed frequency, fixed control flow, and DFX-induced relocation destabilise power/EM templates; persistent profiling is impeded. Timing: The verification path is constant-time with a fixed clock; data-dependent branching/min-time leakage is reduced. Hardware manipulation (HTH): Device-locked, encrypted bitstreams (Device DNA binding) hinder unauthorised reuse; sustained placement manipulation requires a compromised PR channel. Dynamic key resolution:

K_{i} = XOF (S_{ROOT}, dev, POOL [i d x] [, epoch, msg])

; leakage of a single

K_{i}

affects only its packet; cross-message prediction is ineffective. Dictionary: No human-word or low-entropy dependency; offline guessing reduces to brute force over the 128-bit AEAD key space.

Relocation randomness. Under slot-level relocation with R available slots and M reconfigurable modules per message, the number of single-shot placements is

P (R, M) = \frac{R!}{(R - M)!}

, giving

H_{DFX} = {log}_{2} P (R, M)

bits of placement uncertainty. With

M = 2

, this yields

H_{DFX} = 1.0

bit for

R = 2

,

3.6

bits for

R = 4

(

P = 12

), and

6.5

bits for

R = 10

(

P = 90

). These bits are orthogonal to the 128-bit keyspace and should not be summed; they raise the work factor for scraping/probing and power/EM template reuse.

One-line summary. On Arduino Due, ASCON-128a verification is

\approx 226.5 μ s

per packet; scanning

10, 000

packets takes

\approx 2.27 s

. With a leaked session key K, the plain design yields

10, 000 / 10, 000

OK, whereas with a leaked per-message key

K_{i}

under HSF the success is

1 / 10, 000

; brute-force throughput is

\approx 4.4 \times 10^{3} s^{- 1} \Rightarrow

the 128-bit space is practically unreachable (ETA

\approx 10^{27}

years). This empirical picture is consistent with formal indistinguishability and per-message keying assumptions validated in CryptoVerif [68].

4.6. Summary of Results

Measurements quantify the producer (Zynq-7020) and solver (Arduino Due) costs under a common per-message framing. On the producer side, PMU/Global-Timer timings and plateau-based shunt integration at the observed

8.5 - 9.5 V

load range yield per-stage energies and durations (Table 7); HSF–ASCON completes the crypto-only path in

\approx 12 μ s

(KDF + AEAD), while ASCON-128a alone completes in

\approx 6 μ s

. The comparison in Table 10 shows that, at a 1×16 B payload, energy ordering follows latency: ACORN-128 ≫ JAMBU–PRESENT-128 ≳ TinyJAMBU-128 ≫ ASCON-128a, with HSF–ASCON incurring a small KDF overhead yet remaining in the microsecond/

μ

Wh regime. Dynamic partial reconfiguration appears as a separate, single-shot relocation window: the measured 4-RM swap costs

\approx 180.8 ms

and

E_{Δ} \approx 11.75 μ Wh

(absolute

\approx 81.016 μ Wh

), while a 2-RM single-shot scales to

\approx 90.4 ms

and

E_{Δ} \approx 5.875 μ Wh

under identical operating conditions (Table 9). On the solver side, operation at constant

P = 21.6 mW

gives HSF–ASCON verification/decryption of

\approx 962 μ s

, with sub-stage energies proportional to duration (Table 8). Security evaluations indicate that per-message 128-bit keys render exhaustive search impractical [69]; the brute-force throughput measured on Arduino Due is

\approx 4.4 \times 10^{3} s^{- 1}

(Table 13), implying

ETA (50 %) \approx 10^{27}

years for a uniform 128-bit space. Leakage experiments demonstrate damage isolation: a single leaked session key K opens all packets for a plain design, whereas a single leaked per-message key

K_{i}

under HSF opens only its packet (Table 14). Side-channel resistance benefits from constant-frequency operation and compact, fixed-control AEAD verification, together with DFX-based relocation that removes stable physical layouts; placement uncertainty grows with the number of relocation slots R (e.g.,

H_{DFX} \approx 1.0, 3.6, 6.5

bits for

R = {2, 4, 10}

at

M = 2

), which raises the work factor for scraping/probing without altering key entropy. Bitstream encryption bound to silicon identity (Device DNA) counters unauthorized bitstream reuse and supports robustness against hardware manipulation [8,37]. The empirical picture aligns with per-message keying and indistinguishability assumptions established in formal analyses [68]. The attack matrix in Table 15 aligns with the measured brute-force throughput and leakage-isolation trials, indicating high resilience under the stated threat model.

Security contributions of the method. (1) Multi-dimensional domain separation. Keys are separated across algorithm, device, slice, and (optionally) message via

dom

and

(epoch, msg)

. (2) Per-message keying (optional). Including

(epoch, msg)

in the KDF yields a distinct key per packet, localizing exposure and simplifying misuse resistance. (3) Operational limits quantified. Fleet-wide

Q / 2^{t}

bounds, nonce policy, and 64-bit block data limits are stated alongside performance results, enabling risk-aware deployment. (4) Honest reporting across schemes. Tag-size differences (64 versus 128 bits) are flagged in tables to align performance with security level.

5. Discussion

The evaluation indicates that a per-message AEAD with lightweight key derivation (HSF–ASCON) achieves microsecond-scale latency and micro-watt-hour energy while confining any disclosure to a single ciphertext. On the producer (Zynq-7020), crypto-only HSF–ASCON completes in

\approx 12 μ

s with

E \approx 0.004589 μ

Wh for a

1 \times 16

B payload; plain ASCON-128a completes in

\approx 6 μ

s with

E \approx 0.002294 μ

Wh (Table 7). On the solver (Arduino Due), verification/decryption costs scale linearly with time at a constant

P = 21.6

mW (Table 8). Dynamic Function eXchange (DFX) appears as a separate relocation window: the measured 4-RM single-shot costs

\approx 180.8

ms with

Δ E \approx 11.75 μ

Wh; a 2-RM single-shot scales to

\approx 90.4

ms and

Δ E \approx 5.875 μ

Wh under identical operating conditions (Table 9). These windows can be scheduled sparsely relative to traffic, allowing relocation frequency to be tuned to the threat model. A concise head-to-head view is given in Table 16, contrasting baseline ASCON-128a with HSF–ASCON (crypto-only) and HSF–ASCON with a single-shot 2-RM DFX window.

Security posture. Per-message 128-bit keys keep exhaustive search impractical [69]; the measured brute-force throughput on Due (

\approx 4.4 \times 10^{3}

/s) implies

ETA (50 %) \approx 10^{27}

years for a uniform 128-bit space (Table 13). Leakage experiments validate damage isolation: a leaked session key in a plain design compromises all packets in that session, whereas a leaked per-message key

K_{i}

under HSF affects only its packet (Table 14). Device-locked, encrypted bitstreams and silicon identity (Device DNA) restrict unauthorized reuse and help counter hardware manipulation [8,37]. Constant-frequency operation and compact, fixed-control AEAD verification reduce timing variance; DFX relocates BRAM-resident secrets and selected code pages, destabilizing persistent power/EM templates without claiming extra cryptographic entropy. This empirical picture aligns with indistinguishability/per-message-keying assumptions supported by formal analyses [68] and complements prior DFX-based defences such as algorithm hopping and energy-adaptive HSMs [14,43].

Trade-offs and deployability. HSF–ASCON adds a small XOF-KDF cost relative to plain ASCON but remains in the same latency/energy regime for short packets; relocation overhead is external to the crypto path and can be amortized or scheduled. The approach retains compatibility with other lightweight AEADs (e.g., ACORN, TinyJAMBU, JAMBU–PRESENT) while providing per-message keying and relocation as orthogonal hardening layers. Compared with static-layout or static-key baselines, the method improves the compromise scope (message-granular) at modest computational overhead and a tunable relocation cost, addressing the commercial–academic gap noted in prior work [8,21].

Comparison against AEAD baselines. Table 17 summarises crypto-only latency/energy on Zynq-7020 together with the effective compromise scope under the evaluated configurations; relocation overhead is reported separately in Table 9.

Synthesis with state of the art.Table 18 contrasts representative approaches. The HSF row has been updated with the measured crypto-only figures and the single-shot DFX cost under the 2-RM policy (half of the 4-RM window in Table 9).

Related literature note. Prior work emphasises the tension between energy budgets and key-management in embedded/IoT settings. Lightweight primitives and PRESENT-class designs target low area/energy but offer limited key agility in their basic forms [70,71]. Practical constant-time coding and leakage-reduction techniques are recommended to limit timing/power side channels on general-purpose cores [72,73]. One-time-pad/XOR-style schemes avoid computation but are impractical due to key distribution and reuse hazards [74,75]. These observations motivate the use of a small-cost XOF-based KDF to derive per-message AEAD keys under tight energy constraints.

Limitations and outlook. Relocation frequency presents a tunable cost–benefit: higher frequency increases template instability at the expense of additional DFX windows; lower frequency amortizes the cost. The current harness evaluates short messages; larger payloads favor AEADs with higher per-byte throughput, while the HSF cost remains dominated by a fixed KDF+AEAD setup. Strengthening of the reconfiguration channel (e.g., authenticated PR control) remains essential [8,37]. Within these bounds, per-message AEAD with relocatable BRAM provides a practical, energy-aware defence layer for embedded deployments while preserving compatibility with established lightweight primitives.

6. Conclusions

A hybrid protection layer that couples per-message authenticated encryption with lightweight key derivation (HSF–ASCON) and Dynamic Function eXchange (DFX)–assisted relocation of BRAM-resident secrets and selected code pages has been demonstrated for embedded/IoT targets. The cryptographic core adheres to the AEAD interface (key, nonce, AAD → ciphertext, tag) [9,76,77], and derives a fresh per-message key via a sponge-based XOF with domain separation [18,19,20]. ASCON-128a and ASCON-XOF are selected in line with the NIST Lightweight Cryptography process and the algorithm’s published analysis [10,11]. The XOF-KDF construction follows standard practice that permits public context (e.g., device identifier, index) alongside a secret root; this is consistent with HKDF and NIST KDF recommendations on salt/info inputs [30,31,32]. Nonce uniqueness is enforced by a 128-bit counter layout, reflecting widely adopted guidance for AEAD modes [62]. These choices align with Kerckhoffs’s principle: security is anchored in key secrecy rather than algorithm secrecy [33].

Measurements indicate that the method attains microsecond-scale latency and micro-watt-hour energy while confining any disclosure to a single ciphertext. On the producer (Zynq-7020), crypto-only HSF–ASCON completes in

\approx 12 μ

s with

E \approx 0.004589 μ

Wh for a

1 \times 16

B payload, whereas plain ASCON-128a completes in

\approx 6 μ

s with

E \approx 0.002294 μ

Wh (Table 7 and Table 10); timings are obtained via the ARM Global Timer/PMU on the Cortex-A9 as per vendor documentation [63,65]. On the solver (Arduino Due), verification/decryption costs scale linearly at a constant

P = 21.6

mW and measure

\approx 962 μ

s (Table 8). DFX appears as a separate relocation window: the measured single-shot costs are

\approx 180.8

ms with

Δ E \approx 11.75 μ

Wh (4-RM) and

\approx 90.4

ms with

Δ E \approx 5.875 μ

Wh (2-RM) under identical operating conditions (Table 9). A head-to-head summary contrasting baseline ASCON-128a, HSF–ASCON (crypto-only), and HSF–ASCON including a 2-RM single-shot DFX window is provided in Table 16. Additional crypto-only comparisons with ACORN-128, TinyJAMBU-128, and JAMBU–PRESENT-128 are given in Table 17 (specs. in [22,23,24,25]).

Security evaluations support the intended threat model. Per-message 128-bit keys render exhaustive search impractical [69]; the measured verification throughput on Due (

\approx 4.4 \times 10^{3}

/s) implies an infeasible

ETA (50 %) \approx 10^{27}

years for a uniform 128-bit space (Table 13). Leakage experiments confirm damage isolation: a leaked long-lived session key compromises all packets in a plain baseline, whereas a leaked per-message key affects only its packet under HSF (Table 14). Constant-time verification and fixed-frequency operation reduce timing leakage; DFX relocates BRAM-resident secrets and selected code pages to destabilize power/EM templates without claiming extra cryptographic entropy. Device-locked, encrypted bitstreams and silicon identity constrain unauthorized reuse and emphasize the need to secure the partial-reconfiguration channel against hardware manipulation [8,37]. These observations are consistent with accepted AEAD security notions and engineering practice [9,62,76,77].

The approach integrates with established lightweight AEADs while adding orthogonal hardening. On the evaluated setup, HSF–ASCON remains in the same latency/energy regime as ASCON-128a for short payloads and improves compromise scope to the message granularity; baseline figures for ACORN-128, TinyJAMBU-128, and JAMBU–PRESENT-128 are included for context (Table 10 and Table 17) [10,11,22,23,24,25]. DFX scheduling can be tuned sparsely relative to traffic, making the relocation cost adjustable to the threat model. In line with prior DFX-based defences (algorithm hopping, energy-adaptive modules, dynamic obfuscation) [7,8,14,43,45], the present design emphasizes authenticated encryption and per-message keys as the primary cryptographic control.

Several caveats guide deployment and future work. The DFX window is an explicit cost and assumes an authenticated, rate-limited reconfiguration path [37,64]. For larger payloads, per-byte throughput of the chosen AEAD dominates whereas the HSF overhead remains largely fixed; parameterization of pool size and index policy therefore merits workload-aware tuning. Side-channel hardening beyond constant-time code (e.g., power balancing, masking) can further reduce leakage [72,73]. If a PUF is used to derive the long-term root secret, careful, non-interactive provisioning is advised given modelling attacks on certain strong-PUF families [38,78,79]. Finally, per-relocation re-encryption of BRAM contents (outlined in Equation (1)) offers a practical avenue to decorrelate ciphertexts across swaps without altering the online protocol.

In summary, per-message AEAD with XOF-based key derivation, combined with DFX-assisted relocation of BRAM-resident secrets and code pages, delivers a practical and energy-aware defence layer for embedded deployments [10,11]. The method aligns with standardized AEAD/KDF practice [9,20,30,31,32] and with the NIST-endorsed lightweight cryptography portfolio [10,11], while providing tunable physical-layout churn and message-granular damage containment under the measured latency and energy budgets.

Author Contributions

C.B.K. concept; curation; analysis; investigation; methodology; software; resources; draft; visualisation; funding. H.Ş.B. concept; validation; investigation; methodology; review, edit; supervision; administration. F.Y. concept; validation; supervision; methodology; review, edit. All authors have read and agreed to the published version of this manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data supporting the findings are available in the article and repository.

Acknowledgments

The authors would like to acknowledge the use of OpenAI’s ChatGPT or Google’s Gemini for its assistance with language editing and formatting.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Proulx, A.; Chouinard, J.Y.; Fortier, P.; Miled, A. A Survey on FPGA Cybersecurity Design Strategies. ACM Trans. Reconfigurable Technol. Syst. 2023, 16, 1–33. [Google Scholar] [CrossRef]
Wanderley, E.; Vaslin, R.; Crenne, J.; Cotret, P.; Gogniat, G.; Diguet, J.P.; Danger, J.L.; Maurine, P.; Fischer, V.; Badrignans, B.; et al. Security FPGA Analysis. In Security Trends for FPGAS: From Secured to Secure Reconfigurable Systems; Springer: Dordrecht, The Netherlands, 2011; pp. 7–46. [Google Scholar] [CrossRef]
Faraj, M.; Gebotys, C. Quiescent photonics side channel analysis: Low cost SRAM readout attack. Cryptogr. Commun. 2021, 13, 363–376. [Google Scholar] [CrossRef]
Azriel, L.; Speith, J.; Albartus, N.; Ginosar, R.; Mendelson, A.; Paar, C. A survey of algorithmic methods in IC reverse engineering. J. Cryptogr. Eng. 2021, 11, 299–315. [Google Scholar] [CrossRef]
Guin, U.; Huang, K.; DiMase, D.; Carulli, J.M., Jr.; Tehranipoor, M.; Makris, Y. Counterfeit integrated circuits: A rising threat in the global semiconductor supply chain. Proc. IEEE 2014, 102, 1207–1228. [Google Scholar] [CrossRef]
Abideen, Z.U.; Gokulanathan, S.; Aljafar, M.J.; Pagliarini, S. An overview of FPGA-inspired obfuscation techniques. ACM Comput. Surv. 2024, 56, 299. [Google Scholar] [CrossRef]
Karam, R.; Hoque, T.; Ray, S.; Tehranipoor, M.; Bhunia, S. Robust bitstream protection in FPGA-based systems through low-overhead obfuscation. In Proceedings of the 2016 International Conference on ReConFigurable Computing and FPGAs (ReConFig), Cancún, Mexico, 30 November–2 December 2016; pp. 1–8. [Google Scholar] [CrossRef]
Stolz, F.; Albartus, N.; Speith, J.; Klix, S.; Nasenberg, C.; Gula, A.; Fyrbiak, M.; Paar, C.; Güneysu, T.; Tessier, R. LifeLine for FPGA protection: Obfuscated cryptography for real-world security. IACR Trans. Cryptogr. Hardw. Embed. Syst. 2021, 2021, 412–446. [Google Scholar] [CrossRef]
McGrew, D. An Interface and Algorithms for Authenticated Encryption (RFC 5116); Technical Report 5116; RFC Editor. 2008. Available online: https://www.rfc-editor.org/info/rfc5116 (accessed on 15 September 2025).
Dobraunig, C.; Eichlseder, M.; Mendel, F.; Schläffer, M. Ascon v1.2: Lightweight authenticated encryption and hashing. J. Cryptol. 2021, 34, 33. [Google Scholar] [CrossRef]
Turan, M.S.; McKay, K.; Chang, D.; Bassham, L.E.; Kang, J.; Waller, N.D.; Kelsey, J.M. Status Report on the Final Round of the NIST Lightweight Cryptography Standardization Process (NIST IR 8454); Technical Report; National Institute of Standards and Technology: Gaithersburg, MD, USA, 2023. [Google Scholar] [CrossRef]
Silva, C.; Cunha, V.A.; Barraca, J.P.; Aguiar, R.L. Analysis of the Cryptographic Algorithms in IoT Communications. Inf. Syst. Front. 2024, 26, 1243–1260. [Google Scholar] [CrossRef]
Engels, S.; Hoffmann, M.; Paar, C. A critical view on logic locking security. J. Cryptogr. Eng. 2022, 12, 229–244. [Google Scholar] [CrossRef]
Soliman, S.; Jaela, M.A.; Abotaleb, A.M.; Hassan, Y.; Abdelghany, M.A.; Abdel-Hamid, A.T.; Salama, K.N.; Mostafa, H. FPGA implementation of dynamically reconfigurable IoT security module using algorithm hopping. Integration 2019, 68, 108–121. [Google Scholar] [CrossRef]
Song, K.; Zhu, Z.; Yang, H.; Ni, T.; Xu, W. MobileKey: A fast and robust key generation system for mobile devices. In Proceedings of the Adjunct Proceedings of the 2022 ACM International Joint Conference on Pervasive and Ubiquitous Computing and the 2022 ACM International Symposium on Wearable Computers, Atlanta, GA, USA and Cambridge, UK, 11–15 September 2022; pp. 427–431. [Google Scholar] [CrossRef]
Xiao, Q.; Zhao, J.; Feng, S.; Li, G.; Hu, A. Securing NextG networks with physical-layer key generation: A survey. Secur. Saf. 2024, 3, 2023021. [Google Scholar] [CrossRef]
Xia, E.; Hu, B.J.; Shen, Q. A survey of physical layer secret key generation enhanced by intelligent reflecting surface. Electronics 2024, 13, 258. [Google Scholar] [CrossRef]
Bertoni, G.; Daemen, J.; Peeters, M.; Van Assche, G. Sponge functions. In Proceedings of the ECRYPT Hash Workshop 2007, Barcelona, Spain, 24–25 May 2007; Available online: https://keccak.team/files/SpongeFunctions.pdf (accessed on 15 September 2025).
Bertoni, G.; Daemen, J.; Peeters, M.; Van Assche, G. Duplexing the sponge: Single-pass authenticated encryption and other applications. In Selected Areas in Cryptography (SAC 2011); Springer: Berlin/Heidelberg, Germany, 2011; pp. 320–337. [Google Scholar] [CrossRef]
National Institute of Standards and Technology. SHA-3 Standard: Permutation-Based Hash and Extendable-Output Functions (FIPS 202); U.S. Department of Commerce, NIST: Gaithersburg, MD, USA, 2015. Available online: https://nvlpubs.nist.gov/nistpubs/FIPS/NIST.FIPS.202.pdf (accessed on 15 September 2025).
Vipin, K.; Fahmy, S.A. FPGA dynamic and partial reconfiguration: A survey of architectures, methods, and applications. ACM Comput. Surv. 2018, 51, 1–39. [Google Scholar] [CrossRef]
Wu, H. ACORN v3: Lightweight Authenticated Cipher; CAESAR Competition Submission (Round 3). 2016. Available online: https://competitions.cr.yp.to/round3/acornv3.pdf (accessed on 15 September 2025).
Wu, H.; Huang, T. TinyJAMBU: A Family of Lightweight Authenticated Encryption Algorithms; NIST LWC Submission (Round 1). 2019. Available online: https://csrc.nist.gov/CSRC/media/Projects/Lightweight-Cryptography/documents/round-1/spec-doc/TinyJAMBU-spec.pdf (accessed on 15 September 2025).
Wu, H.; Huang, T. JAMBU: A Lightweight Authenticated Encryption Mode (v2.1); CAESAR Competition Submission (Round 3). 2016. Available online: https://competitions.cr.yp.to/round3/jambuv21.pdf (accessed on 15 September 2025).
Bogdanov, A.; Knudsen, L.R.; Leander, G.; Paar, C.; Poschmann, A.; Robshaw, M.; Seurin, Y.; Vikkelsoe, C. PRESENT: An ultra-lightweight block cipher. In Cryptographic Hardware and Embedded Systems—CHES 2007; Springer: Berlin/Heidelberg, Germany, 2007; pp. 450–466. [Google Scholar] [CrossRef]
Biryukov, A.; Shamir, A.; Wagner, D. Real time cryptanalysis of A5/1 on a PC. In Proceedings of the Fast Software Encryption: 7th International Workshop, FSE 2000, New York, NY, USA, 10–12 April 2001; pp. 1–18. [Google Scholar] [CrossRef]
Raghunath, B.H.; Aravind, H.S. An Efficient FPGA-Based Dynamic Partial Reconfigurable Implementation. Int. J. Intell. Syst. Appl. Eng. 2023, 11, 183–192. Available online: https://ijisae.org/index.php/IJISAE/article/view/2471 (accessed on 15 September 2025).
Al-Haija, Q.A.; Enshasy, H.; Smadi, A. Estimating energy consumption of diffie hellman encrypted key exchange (DH-EKE) for wireless sensor network. In Proceedings of the 2017 IEEE International Conference on Intelligent Techniques in Control, Optimization and Signal Processing (INCOS), Srivilliputtur, India, 23–25 March 2017; pp. 1–6. [Google Scholar] [CrossRef]
Shannon, C.E. Communication theory of secrecy systems. Bell Syst. Tech. J. 1949, 28, 656–715. Available online: https://www.cs.miami.edu/home/burt/learning/csc685.211/bstj28-4-656.pdf (accessed on 15 September 2025). [CrossRef]
Krawczyk, H.; Eronen, P. HMAC-Based Extract-and-Expand Key Derivation Function (HKDF) (RFC 5869); Technical Report 5869; Internet Engineering Task Force. 2010. Available online: https://www.rfc-editor.org/info/rfc5869 (accessed on 15 September 2025).
Barker, E.; Chen, L.; Davis, R. Recommendation for Key Derivation Through Extraction-then-Expansion (NIST Special Publication 800-56C); Technical Report; National Institute of Standards and Technology: Gaithersburg, MD, USA, 2011. [CrossRef]
Chen, L. Recommendation for Key Derivation Using Pseudorandom Functions (NIST SP 800-108); Technical Report; National Institute of Standards and Technology: Gaithersburg, MD, USA, 2009. [CrossRef]
Kerckhoffs, A. La cryptographie militaire. J. Sci. Mil. 1883, 9, 5–38, 161–191. Available online: https://www.petitcolas.net/kerckhoffs/crypto_militaire_1.pdf (accessed on 15 September 2025).
Moradi, A.; Barenghi, A.; Kasper, T.; Paar, C. On the vulnerability of FPGA bitstream encryption against power analysis attacks: Extracting keys from xilinx Virtex-II FPGAs. In Proceedings of the 18th ACM Conference on Computer and Communications Security (CCS ’11), Chicago, IL, USA, 17–21 October 2011; pp. 111–124. [Google Scholar] [CrossRef]
Moradi, A.; Oswald, D.; Paar, C.; Swierczynski, P. Side-channel attacks on the bitstream encryption mechanism of Altera Stratix II: Facilitating black-box analysis using software reverse-engineering. In Proceedings of the ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA ’13), Monterey, CA, USA, 11–13 February 2013; pp. 91–100. [Google Scholar] [CrossRef]
Swierczynski, P.; Becker, G.T.; Moradi, A.; Paar, C. Bitstream Fault Injections (BiFI)—Automated Fault Attacks Against SRAM-Based FPGAs; Cryptology ePrint Archive, Paper 2016/641. 2016. Available online: https://eprint.iacr.org/2016/641 (accessed on 15 September 2025).
Johnson, A.P.; Patranabis, S.; Chakraborty, R.S.; Mukhopadhyay, D. Remote dynamic partial reconfiguration: A threat to Internet-of-Things and embedded security applications. Microprocess. Microsyst. 2017, 52, 131–144. [Google Scholar] [CrossRef][Green Version]
Maes, R. Physically Unclonable Functions: Constructions, Properties and Applications; Springer: Berlin/Heidelberg, Germany, 2013. [Google Scholar] [CrossRef]
Van Herrewege, A. Lightweight PUF-Based Key and Random Number Generation. Ph.D. Thesis, KU Leuven, Leuven, Belgium, 2015. Available online: https://lirias.kuleuven.be/handle/123456789/469975 (accessed on 15 September 2025).
Ender, M.; Moradi, A.; Paar, C. The Unpatchable Silicon: A Full Break of the Bitstream Encryption of Xilinx 7-Series FPGAs. In Proceedings of the 29th USENIX Security Symposium (USENIX Security 20), Boston, MA, USA, 12–14 August 2020; pp. 1803–1819. Available online: https://www.usenix.org/conference/usenixsecurity20/presentation/ender (accessed on 15 September 2025).
Kataria, J.; Housley, R.; Pantoga, J.; Cui, A. Defeating Cisco trust anchor: A case-study of recent advancements in direct FPGA bitstream manipulation. In Proceedings of the 13th USENIX Workshop on Offensive Technologies (WOOT ’19), Santa Clara, CA, USA, 12 August 2019; Available online: https://www.usenix.org/conference/woot19/presentation/kataria (accessed on 15 September 2025).
Ender, M.; Swierczynski, P.; Wallat, S.; Wilhelm, M.; Knopp, P.M.; Paar, C. Insights into the mind of a trojan designer: The challenge to integrate a trojan into the bitstream. In Proceedings of the 24th Asia and South Pacific Design Automation Conference (ASPDAC ’19), Tokyo, Japan, 21–24 January 2019; pp. 112–119. [Google Scholar] [CrossRef]
Samir, N.; Gamal, Y.; El-Zeiny, A.; Mahmoud, O.; Shawky, A.; Saeed, A. Energy-Adaptive Lightweight Hardware Security Module using Partial Dynamic Reconfiguration for Energy Limited Internet of Things Applications. In Proceedings of the 2019 IEEE International Symposium on Circuits and Systems (ISCAS), Sapporo, Japan, 26–29 May 2019; pp. 1–4. [Google Scholar] [CrossRef]
Wei, Z.; Cui, Y.; Chen, Y.; Wang, C.; Gu, C.; Liu, W. Transformer PUF: A Highly Flexible Configurable RO PUF Based on FPGA. In Proceedings of the 2020 IEEE Workshop on Signal Processing Systems (SiPS), Coimbra, Portugal, 20–22 October 2020; pp. 1–6. [Google Scholar] [CrossRef]
Sunkavilli, S.; Chennagouni, N.G.; Yu, Q. DPReDO: Dynamic Partial Reconfiguration enabled Design Obfuscation for FPGA Security. In Proceedings of the 2022 IEEE 35th International System-on-Chip Conference (SOCC), Belfast, UK, 5–8 September 2022; pp. 1–6. [Google Scholar] [CrossRef]
Sasdrich, P.; Moradi, A.; Mischke, O.; Güneysu, T. Achieving side-channel protection with dynamic logic reconfiguration on modern FPGAs. In Proceedings of the 2015 IEEE International Symposium on Hardware Oriented Security and Trust (HOST), Washington, DC, USA, 5–7 May 2015; pp. 130–136. [Google Scholar] [CrossRef]
Bommana, S.R.; Veeramachaneni, S.; Ershad, S.; Srinivas, M.B. Mitigating Side Channel Attacks on FPGA through Deep Learning and Dynamic Partial Reconfiguration. Sci. Rep. 2025, 15, 13745. [Google Scholar] [CrossRef]
Janakiraman, S.; Vinoth Raj, R.; Sivaraman, R.; Sridevi, A.; Upadhyay, H.N.; Amirtharajan, R. Integrity-verified lightweight ciphering for secure medical image sharing between embedded SoCs. Sci. Rep. 2025, 15, 7465. [Google Scholar] [CrossRef]
Kharidu, H.K.; Sudha, V. FPGA implementation of EEG based hardware optimized data encryption technique for IoT applications. Integration 2025, 102, 102381. [Google Scholar] [CrossRef]
Shafique, A.; Naqvi, S.A.A.; Raza, A.; Ghalaii, M.; Papanastasiou, P.; McCann, J.; Abbasi, Q.H.; Imran, M.A. A hybrid encryption framework leveraging quantum and classical cryptography for secure transmission of medical images in IoT-based telemedicine networks. Sci. Rep. 2024, 14, 31054. [Google Scholar] [CrossRef]
Sivaranjani Devi, C.; Amirtharajan, R. A novel 2D MTMHM based key generation for enhanced security in medical image communication. Sci. Rep. 2025, 15, 25411. [Google Scholar] [CrossRef]
Maazouz, M.; Toubal, A.; Bengherbia, B.; Houhou, O.; Batel, N. FPGA implementation of a chaos-based image encryption algorithm. J. King Saud Univ. Comput. Inf. Sci. 2022, 34, 9926–9941. [Google Scholar] [CrossRef]
Ciylan, F.; Ciylan, B.; Atak, M. FPGA-based chaotic image encryption using systolic arrays. Electronics 2023, 12, 2729. [Google Scholar] [CrossRef]
Ekdahl, P.; Johansson, T. Another attack on A5/1. IEEE Trans. Inf. Theory 2003, 49, 284–289. [Google Scholar] [CrossRef]
Güneysu, T.; Kasper, T.; Novotný, M.; Paar, C.; Rupp, A. Cryptanalysis with COPACOBANA. IEEE Trans. Comput. 2008, 57, 1498–1513. [Google Scholar] [CrossRef]
Chakraborty, R.S.; Saha, I.; Palchaudhuri, A.; Naik, G.K. Hardware Trojan Insertion by Direct Modification of FPGA Configuration Bitstream. IEEE Des. Test 2013, 30, 45–54. [Google Scholar] [CrossRef]
Adetomi, A.; Enemali, G.; Arslan, T. Relocating Encrypted Partial Bitstreams by Advance Task Address Loading. In Proceedings of the 2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM), Napa, CA, USA, 30 April–2 May 2017; pp. 188–191. [Google Scholar] [CrossRef]
Moraitis, G.; Dubrova, E. Bitstream Modification Attack on SNOW 3G; Cryptology ePrint Archive, Paper 2020/038. 2020. Available online: https://eprint.iacr.org/2020/038 (accessed on 15 September 2025).
Albartus, N.; Hoffmann, M.; Temme, S.; Azriel, L.; Paar, C. DANA: Universal dataflow analysis for gatelevel netlist reverse engineering. IACR Trans. Cryptogr. Hardw. Embed. Syst. 2020, 2020, 309–336. [Google Scholar] [CrossRef]
Nabeel, N.; Habaebi, M.H.; Islam, M.D.R. Security Analysis of LNMNT-LightWeight Crypto Hash Function for IoT. IEEE Access 2021, 9, 165754–165765. [Google Scholar] [CrossRef]
Yao, L.; Liang, H.; Han, Q.; Zhang, H.; Huang, Z.; Jiang, C.; Yi, M.; Lu, Y. M-RO PUF: A portable pure digital RO PUF based on MUX unit. Microelectron. J. 2022, 119, 105314. [Google Scholar] [CrossRef]
Dworkin, M. Recommendation for Block Cipher Modes of Operation: Galois/Counter Mode (GCM) and GMAC (NIST SP 800-38D); Technical Report; National Institute of Standards and Technology: Gaithersburg, MD, USA, 2007. Available online: https://nvlpubs.nist.gov/nistpubs/Legacy/SP/nistspecialpublication800-38d.pdf (accessed on 15 September 2025).
Xilinx. Zynq-7000 All Programmable SoC Technical Reference Manual (UG585); Xilinx: San Jose, CA, USA, 2018; Available online: https://docs.amd.com/r/en-US/ug585-zynq-7000-SoC-TRM (accessed on 15 September 2025).
Vipin, K.; Fahmy, S.A. Efficient region allocation for adaptive partial reconfiguration. In Proceedings of the 2011 International Conference on Field-Programmable Technology, New Delhi, India, 12–14 December 2011; pp. 1–6. [Google Scholar] [CrossRef]
ARM Ltd. ARM Cortex-A9 MPCore Technical Reference Manual (r3p0); ARM Ltd.: Cambridge, UK, 2010; Available online: https://developer.arm.com/documentation/ddi0407/latest/ (accessed on 15 September 2025).
AMD. Vivado Design Suite User Guide: Dynamic Function eXchange (UG909); AMD: Santa Clara, CA, USA, 2025; Available online: https://docs.amd.com/r/en-US/ug909-vivado-partial-reconfiguration (accessed on 15 September 2025).
AMD. Vivado Design Suite User Guide: Programming and Debugging (UG908); AMD: Santa Clara, CA, USA, 2025; Available online: https://docs.amd.com/r/en-US/ug908-vivado-programming-debugging (accessed on 15 September 2025).
Blanchet, B. Dealing with Dynamic Key Compromise in Crypto Verif. In Proceedings of the 2024 IEEE 37th Computer Security Foundations Symposium (CSF), Enschede, The Netherlands, 8–12 July 2024; IEEE: Piscataway, NJ, USA; pp. 495–510. [Google Scholar] [CrossRef]
Stallings, W. Cryptography and Network Security: Principles and Practice, 7th ed.; Pearson: Harlow, UK, 2016. [Google Scholar]
Poschmann, A. Lightweight Cryptography—Cryptographic Engineering for a Pervasive World; Cryptology ePrint Archive, Paper 2009/516. 2009. Available online: https://eprint.iacr.org/2009/516 (accessed on 15 September 2025).
Sadeghi, A.R.; Wachsmann, C.; Waidner, M. Security and privacy challenges in Industrial Internet of Things. In Proceedings of the 52nd Annual Design Automation Conference, San Francisco, CA, USA, 7–11 June 2015; pp. 1–6. [Google Scholar] [CrossRef]
Gueron, S. Intel Advanced Encryption Standard (AES) New Instructions Set (Rev. 3.01); Technical Report; Intel Corporation: Santa Clara, CA, USA, 2010; Available online: https://www.intel.com/content/dam/doc/white-paper/advanced-encryption-standard-new-instructions-set-paper.pdf (accessed on 15 September 2025).
Nguyen, H.; Hoang, T.; Tran, L. Efficient hardware implementation of elliptic-curve Diffie–Hellman ephemeral on Curve25519. Electronics 2023, 12, 4480. [Google Scholar] [CrossRef]
Manifavas, H.; Hatzivasilis, G.; Fysarakis, K.; Papaefstathiou, Y. A survey of lightweight stream ciphers for embedded systems. Secur. Commun. Netw. 2015, 9, 1226–1246. [Google Scholar] [CrossRef]
Chen, H.; Che, M.; Seiki, N.; Shiramizu, T.; Yano, T.; Mikami, Y.; Ueda, Y.; Kato, K. Physically encrypted wireless transmission based on XOR between two data in terahertz beams. Electronics 2023, 12, 2629. [Google Scholar] [CrossRef]
Bellare, M.; Namprempre, C. Authenticated encryption: Relations among notions and analysis of the generic composition paradigm. In Advances in Cryptology—ASIACRYPT 2000; Springer: Berlin/Heidelberg, Germany, 2000; pp. 531–545. [Google Scholar] [CrossRef]
Krawczyk, H. The order of encryption and authentication for protecting communications. In Advances in Cryptology—CRYPTO 2001; Springer: Berlin/Heidelberg, Germany, 2001; pp. 310–331. [Google Scholar] [CrossRef]
Rührmair, U.; Sölter, J.; Sehnke, F.; Xu, X.; Mahmoud, A.; Stoyanova, V.; Dror, G.; Schmidhuber, J.; Burleson, W.; Devadas, S. PUF modeling attacks on simulated and silicon data. IEEE Trans. Inf. Forensics Secur. 2013, 8, 1876–1891. [Google Scholar] [CrossRef]
Ning, H.; Farha, F.; Ullah, A.; Mao, L. Physical unclonable function: Architectures, applications and challenges for dependable security. IET Circuits Devices Syst. 2020, 14, 407–424. [Google Scholar] [CrossRef]

Figure 1. Reading the Bits in RAM during a QP-Based Side Channel Attack under Fixed-Location Memory Usage [3] (a) Scraping; (b) RAM; (c) Bits.

Figure 2. HSF–ASCON flow: KDF(ASCON-XOF), Nonce/AAD, ASCON-128a, receiver checks; DFX relocates BRAM-resident secrets and selected code pages (Pseudocode is shown on Algorithm 1).

Figure 3. Proposed hardware architecture on Zynq-7020 using DFX to swap BRAM contents. Two reconfigurable regions. A SmartConnect couples these resources to the ARM processing system via the AXI interconnect. DFX is used to swap the contents of BRAMs.

Figure 4. Internal view of a reconfigurable module (RM). Each RM contains an AXI BRAM Controller and a BRAM instance that holds the code and secret pool. During DFX, the BRAM controller and memory are removed and replaced with an alternate instance.

Figure 5. Shunt capture via Digilent Analog Discovery Discovery 3 across

R_{s} = 18.408 Ω

; UART, AEAD, and DFX plateaus are annotated. (Note. The shunt capture shows three plateaus: a low-current UART transmit window, a mid-level idle+crypto window covering KDF+AEAD execution, and a short over-current DFX window due to single-shot partial reconfiguration).

Figure 5. Shunt capture via Digilent Analog Discovery Discovery 3 across

R_{s} = 18.408 Ω

; UART, AEAD, and DFX plateaus are annotated. (Note. The shunt capture shows three plateaus: a low-current UART transmit window, a mid-level idle+crypto window covering KDF+AEAD execution, and a short over-current DFX window due to single-shot partial reconfiguration).

Table 1. A summary of state-of-the-art methods and identified gaps.

Author	Year	Method	Platform	Application	Limitations/Identified Gaps
Biryukov [26]	2001	Real-time TMTO attack (A5/1)	PC	GSM cryptanalysis	Needs large precomputation; attack-focused.
Ekdahl [54]	2003	Correlation attack (A5/1)	PC	GSM cryptanalysis	Requires long keystream; not IP protection.
Güneysu [55]	2008	COPACOBANA HW attack	FPGA cluster	A5/1/DES breaking	Dedicated HW; attack-only; not a defence.
Moradi [34]	2011	Side-channel attacks	Virtex/Stratix	Vendor encryption	Demonstration; no countermeasure.
Maes [38]	2013	PUF-based keys	Virtex-6	Device key binding	Vulnerable post-decryption.
Chakraborty [56]	2013	Bitstream Trojan insertion	Virtex-5	Concept demo	No integrated defence provided.
Van Herrewege [39]	2015	PUF key binding	Artix-7	Anti-cloning	Side-channel vulnerability.
Sasdrich [46]	2015	Moving-target cipher	FPGA	SCA mitigation	Cipher-specific; no general IP protection.
Swierczynski [36]	2016	Fault injection attack	Spartan-6	Key leakage	Physical access required; active attack.
Karam [7]	2016	Dark silicon obfuscation	Cyclone IV	IP protection	Static obfuscation; no dynamic updates.
Johnson [37]	2017	Remote DPR attack	Spartan-6	Trojan insertion	Attack only; no defense.
Adetomi [57]	2017	ATAL bitstream relocation	Virtex-5	Module relocation	No remote update or side-channel protection.
Kataria [41]	2019	Bitstream manipulation	Virtex	Auth. bypass	No built-in defence; bit flipping.
Ender [42]	2019	Automated Trojans	Artix-7	Trojan injection	Tool provided; static obfuscation vulnerable.
Soliman [14]	2019	Algorithm hopping	Zynq-7020	IoT security	No bitstream protection/authentication.
Samir [43]	2019	Energy-adaptive HSM	Zynq-7000	IoT security	No bitstream protection; switching delay risks.
Wei [44]	2020	Dynamic CRO-PUF	Kintex-7	Authentication	Not for functional logic protection.
Ender [40]	2020	Starbleed attack	Xilinx 7-series	Bitstream recovery	Protocol exploit; unpatched devices vulnerable.
Moraitis [58]	2020	LUT modification attack	Spartan-6	Key extraction	LUT modification; logic obfuscation required.
Albartus [59]	2020	DANA netlist analysis	N/A	Reverse engineering	Analysis tool only; no protection.
Stolz [8]	2021	LifeLine co-obfuscation	Zynq-7020	IP protection	Complexity; no encryption.
Faraj [3]	2021	Photonic side-channel	SRAM FPGA	SRAM extraction	Attack only; photonic shielding needed.
Nabeel [60]	2021	Lightweight security module	Zynq-7020	IoT security	Resources not reported; no bitstream protection.
Engels [13]	2022	Logic locking critique	N/A	Security survey	Vulnerable to SAT attacks; no DPR used.
Sunkavilli [45]	2022	DPReDO obfuscation	Ultrascale+	Trojan defence	Prototype; secure DPR required.
Maazouz [52]	2022	Dynamic S-box (4D chaos)	Xilinx FPGA	General encryption	35% LUT; no authentication layer.
Yao [61]	2022	Reconfigurable PUF security	Kintex-7	Authentication	Targets device auth; not code protection.
Ciylan [53]	2023	Systolic-array chaos convol’n	Virtex-7	Image encryption	55% LUT, 280 mW; no integrity/obfuscation.
Proulx [1]	2023	FW/IP protection via DPR	FPGA SoC	IP protection	Dual-core+RP; lacks crypto; SCA risk.
Shafique [50]	2024	Quantum-chaos hybrid	Simulated	Medical imaging	Complex; no hardware; simulation only.
Bommana [47]	2025	DL-based SCA defence	FPGA	SCA mitigation	Energy overhead unknown.
Janakiraman [48]	2025	Lorentz-chaos cipher	PYNQ-Z1	Image encryption	High resource use; no integrity check.
Kharidu [49]	2025	EEG-based PRNG cipher	Custom FPGA	Biometric IoT	Requires EEG input; no auth/integrity.
Sivaranjani [51]	2025	2D-MTMHM chaos map	MATLAB	Image encryption	Software-only; no FPGA; confidentiality only.
Kıyak	2025	Hybrid Security Framework	Zynq-7020	Code theft protection	Requires key-pool storage; DFX complexity.

Table 3. Field semantics and security rationale (symbols are defined as in the same manner as in Figure 2).

Symbol	Scope/Size	Source/Definition	Role & Security Rationale
$S_{ROOT}$ (ROOT)	Secret/256 b	Device-internal root secret	Only input to KDF; never leaves device. Brute-force $\approx 2^{256}$ ; effective system security bounded by 128-bit AEAD/tag.
POOL[idx] (slice)	Secret/128 b	128-bit window derived from $S_{ROOT}$ by non-sequential offsets	Diversifies KDF input. Windows may overlap; index repeats do not reduce per-packet uniqueness because $(e p o c h, m s g)$ also enter KDF.
idx	Public/8 b (policy)	RNG/CSPRNG or policy counter	Selects slice in POOL. Leakage is benign: without $S_{ROOT}$ the slice value cannot be reconstructed.
dev, dev_N, dev_AAD	Public/32 b (in Nonce/AAD)	Device identifier	Binds packets to device; dev_N must equal dev_AAD. Authenticity is provided by AEAD tag, not secrecy of dev.
epoch, epoch_N	Public/32 b	Monotone epoch counter	Anti-replay bucket. Increment on reboot or persist last seen state.
msg, msg_N,	Public/64 b	Monotone message counter	Per-epoch freshness; forms Nonce with
msg_AAD, msg_cntr			dev,epoch. msg_N must equal msg_AAD; msg_cntr is local state.
ts (optional)	Public/64 b	Timestamp if policy requires	Optional AAD extension for logging/windowing; not required for security when (epoch,msg) are present.
ver, verB	Public/8 b	Protocol version; verB is receiver policy	Negotiation/compatibility. Mismatch → reject.
feat, featB	Public/8 b (bit mask)	Feature/policy bits; featB is receiver policy	Enables capability gating (e.g., long AAD). Not secrecy-critical; integrity via tag.
Nonce	Public/128 b	pack128(dev, epoch, msg)	AEAD Nonce. Uniqueness per key is mandatory; construction ensures uniqueness under monotone (epoch,msg).
AAD (15/23 B)	Public	ver\|dev\|idx\|msg\|feat[\|ts]	Integrity-protected associated data; any bit flip causes tag failure (INT-CTXT).
$K_{i}$	Secret/128 b	XOF(Root \|Pool(idx)\|dev [‖epoch‖msg])	Per-packet key. Leakage of a single $K_{i}$ does not help on other packets (domain separation on $(e p o c h, m s g)$ ).
CT, TAG	Secret output/128 b tag	AEAD(ASCON-128a)	Provides IND-CPA + INT-CTXT. Forgery prob. $\leq 2^{- 128}$ .

Table 4. Comparison of configuration access ports for partial reconfiguration.

Interface	Devices	Accessed via	Remarks
PCAP	Zynq-7000	PS (`DEVCFG`)	Full/partial reconfig; encrypted bitstreams; used in this design.
ICAP	All series	PL logic	High bandwith in-fabric PR; requires custom ICAP controller; not used.
MCAP	UltraScale(+)	PCIe endpoint	Available only on UltraScale MPSoCs; not supported by Zynq-7020.

Table 5. Characteristics of memory options on Zynq-7020 and rationale for use [63].

Memory	Vol.	Own.	Access	Cap.	Use Case	Rationale
QSPI flash	N	PS	PS/ext	MBs	Bitstream	Non-volatile; accessible externally;
						used for boot images.
eFUSE/BBRAM	N	PS	Boot ctl	256 bit	Bit enc. key	OTP; too small for list;
						used for bitstream encryption.
OCM	Y	PS	PS only	256 KiB	Stack/code	Fast on-chip; fixed;
						used for FSBL boot process.
BRAM (PL)	Y	PL	PS/PL AXI	18-36 KiB/blk	Keys, list	On-chip; DFX-movable; AXI-mapped;
						used for secrets/pool.
PS DDR3	Y	PS	PS/ext	MBs	App data	Off-chip; bus-probe risk;
						can be used for non-secret payloads.
UltraRAM	Y	PL	PL only	≥288 KiB	Large buf	Absent on XC7Z020
						(UltraScale feature)
Distributed RAM	Y	PL	PL only	few KiB	Tiny buf	LUT-RAM; too small;
						fixed
L1/L2 caches	Y	PS	CPU	32-512 KiB	Cache	HW-managed;
						not addressable

Table 6. Security-driven memory selection (code theft by scraping and bus probing).

Memory	On-Chip	DFX	Notes
PS DDR	No	No	High capacity but vulnerable to bus probing;
			not used for plaintext keys/secret-pools.
QSPI	No	No	Holds boot images;
			never stores secrets in clear.
OCM	Yes	No	On-chip but fixed and PS-only;
			unsuitable for moving-target defense.
eFUSE/BBRAM	Yes	No	Dedicated to the bitstream AES key (AES-256) for PL decryption;
			not for dynamic keys [67].
PL BRAM	Yes	Yes	Chosen store for encrypted secret-pools and session material;
			can be physically relocated and scrubbed via DFX.

Table 7. Zynq-7020: cycles, duration, power/energy (HSF–ASCON and reference AEADs).

Algorithm	Cycles	Duration	I@9.3V	I@12V	P@9.3V	P@12V	Energy ( $μ$ Wh)
/Sub-Stages		( $μ$ s)	(A)	(A)	(W)	(W)	@9.3/@12
HSF–ASCON-128a	10,400	12.00	0.149	0.125	1.38	1.50	0.0046/0.0050
KDF	5511	6.63	0.149	0.125	1.38	1.50	0.0025/0.0028
AEAD (total)	4375	5.26	0.149	0.125	1.38	1.50	0.0020/0.0022
init	980	1.18	0.149	0.125	1.38	1.50	0.0005/0.0005
aad	1183	1.42	0.149	0.125	1.38	1.50	0.0005/0.0006
msg	934	1.12	0.149	0.125	1.38	1.50	0.0004/0.0005
fin	978	1.18	0.149	0.125	1.38	1.50	0.0005/0.0005
ASCON-128a	5200	6.00	0.149	0.125	1.38	1.50	0.0023/0.0025
AEAD (total)	4793	5.52	0.149	0.125	1.38	1.50	0.0021/0.0023
init	1225	1.41	0.149	0.125	1.38	1.50	0.0005/0.0006
aad	1343	1.55	0.149	0.125	1.38	1.50	0.0006/0.0006
msg	1096	1.25	0.149	0.125	1.38	1.50	0.0005/0.0005
fin	1129	1.26	0.149	0.125	1.38	1.50	0.0005/0.0005
ACORN-128	433,100	500.00	0.149	0.125	1.38	1.50	0.1912/0.2083
AEAD (total)	432,997	499.88	0.149	0.125	1.38	1.50	0.1911/0.2083
init	227,347	262.46	0.149	0.125	1.38	1.50	0.1004/0.1094
aad	50,615	58.43	0.149	0.125	1.38	1.50	0.0223/0.0243
msg	53,078	61.28	0.149	0.125	1.38	1.50	0.0234/0.0255
fin	101,916	117.66	0.149	0.125	1.38	1.50	0.0450/0.0490
TinyJAMBU-128	149,263	172.00	0.149	0.125	1.38	1.50	0.0658/0.0717
AEAD (total)	149,158	171.88	0.149	0.125	1.38	1.50	0.0657/0.0716
init	19,617	22.61	0.149	0.125	1.38	1.50	0.0086/0.0094
aad	29,344	33.81	0.149	0.125	1.38	1.50	0.0129/0.0141
msg	87,877	101.26	0.149	0.125	1.38	1.50	0.0387/0.0422
fin	12,244	14.11	0.149	0.125	1.38	1.50	0.0054/0.0059
JAMBU–PRSNT-128	317,142	366.00	0.149	0.125	1.38	1.50	0.1400/0.1525
AEAD (total)	316,896	365.72	0.149	0.125	1.38	1.50	0.1398/0.1524
init	53,568	61.82	0.149	0.125	1.38	1.50	0.0236/0.0258
aad	105,319	121.54	0.149	0.125	1.38	1.50	0.0465/0.0506
msg	105,407	121.60	0.149	0.125	1.38	1.50	0.0465/0.0507
fin	52,580	60.68	0.149	0.125	1.38	1.50	0.0232/0.0253

Note. Crypto plateaus were measured at

V_{load} \approx 9.265 V

and

I \approx 0.149 A

from the shunt (Analog Discovery 3), giving

P_{load} \approx 1.38 W

. The 12 V column uses the PicoTest Multimeter measured average current

I \approx 0.125 A \Rightarrow P_{12 V} \approx 1.50 W

. Energies are

E_{μ Wh} = P \cdot Δ t_{μ s} / 3600

.

Table 8. Arduino Due (12 V PicoTest): duration and power/energy.

Algorithm/Sub-Stage	Duration (µs)	I@12 V (mA)	P@12 V (mW)	Energy (µWh) @12
HSF–ASCON	962	1.8	21.6	0.00577
parse	486	1.8	21.6	0.00292
KDF	254	1.8	21.6	0.00152
AEAD	222	1.8	21.6	0.00133
ASCON	710	1.8	21.6	0.00426
ACORN-128	21,663	1.8	21.6	0.12998
TinyJAMBU-128	5,302	1.8	21.6	0.03181
JAMBU–PRESENT-128	17,299	1.8	21.6	0.10379

Note. Arduino Due measurements use a 12 V PicoTest aggregate with constant

I = 1.8 mA

(

P = 21.6 mW

); energies follow

E (μ Wh) = P \cdot Δ t_{μ s} / 3600

. The HSF–ASCON total and its parse/KDF/AEAD breakdown match the consolidated energy table (e.g.,

0.00577 μ Wh

per operation for HSF–ASCON).

Table 9. Plateau windows on Zynq (shunt method): duration, current, power, and energy.

Window	$Δ t$ (ms)	$V_{shunt}$ (V)	I (mA)	$V_{load}$ (V)	P (W)	E (µWh)
UART (TX)	99.170	2.524	137.114	9.476	1.2993	35.792
HSF (full window)	0.012	2.735	148.577	9.265	1.3766	0.004589
HSF delta (vs UART)	0.012	+0.211	+11.462	−0.211	+0.07727	0.000258
DFX (full window)	180.800	3.489	189.537	8.511	1.6132	81.016
DFX delta (vs idle)	180.800	+0.7466	+40.559	−0.7466	+0.23397	11.750
DFX (2-RM full window)	90.400	3.489	189.537	8.511	1.6132	40.508
DFX delta (2-RM versus idle)	90.400	+0.7466	+40.559	−0.7466	+0.23397	5.875

Note. Crypto and DFX plateaus are evaluated at the measured shunt-derived operating point; energies use per-window

V_{load} = 12.0 - V_{shunt}

. The 2-RM DFX rows are obtained by the linear scaling of the single-shot partial reconfiguration window:

t \approx 90.4 ms

,

E_{Δ} \approx 5.875 μ Wh

, and absolute

E \approx 40.508 μ Wh

(half of the 4-RM window). Crypto and DFX plateaus are evaluated at the measured shunt-derived operating point; energies use per-window

V_{load} = 12.0 - V_{shunt}

. The 2-RM DFX rows are obtained by linear scaling of the single-shot partial reconfiguration window:

t \approx 90.4 ms

,

E_{Δ} \approx 5.875 μ Wh

, absolute

E \approx 40.508 μ Wh

(half of the 4-RM window). Italicized delta rows represent calculated differences between states, distinguishing them from direct measurements.

Table 10. The energy/latency of code-protection schemes (shunt-only, algorithm phase).

Study	Platform	Application	Energy/Power	Latency	Notes
Soliman [14]	Zynq-7020	5-cipher hop	10 mW dyn.	10 ms/sw.	DPR-based cipher hopping;
					requires extra bitstreams
Samir [43]	Zynq-7000	Pwr-adapt. HSM	≤10 mW dyn.	50–100 ms/sw.	Energy-adaptive AEAD
					selection; no bitstream defence
JAMBU-128	Zynq-7020	AEAD (1×16 B)	0.139951 µWh/msg	366 µs	AEAD-only;
(PRESENT)
T-JAMBU-128	Zynq-7020	AEAD (1×16 B)	0.065769 µWh/msg	172 µs	AEAD-only;
ACORN-128	Zynq-7020	AEAD (1×16 B)	0.191189 µWh/msg	500 µs	AEAD-only;
Ascon-128a	Zynq-7020	AEAD (1×16 B)	0.002294 µWh/msg	6 µs	AEAD-only;
HSF-ASCON	Zynq-7020	Comm. resistance	0.004589 µWh/msg	12 µs	HSF wrapper
(No DFX)					(XOF KDF, nonce, AAD)
HSF-ASCON	Zynq-7020	Crypto + DFX	11.755 µWh/msg	12 µs	HSF + DFX (secrets + code)
(+DFX, 4-RM)				+ 180.8 ms	+Additional Cost
HSF-ASCON	Zynq-7020	Crypto + DFX (2-RM)	5.875 µWh/msg	12 µs	HSF + DFX (single-shot 2-RM)
(+DFX, 2-RM)				+ 90.4 ms	+Additional Cost

Note. DFX rows scale linearly with the number of partial reconfiguration swaps per message; per-swap values from Table 9: for the 4-RM bitstream

t = 180.8 ms

,

E_{Δ} = 11.75 μ Wh

(absolute

81.016 μ Wh

). For a 2-RM single-shot these are approximately halved (

t \approx 90.4 ms

,

E_{Δ} \approx 5.875 μ Wh

).

Table 11. Data Limits.

Payload Size	Blocks/msg	Max Messages (at $2^{29}$ Blocks)
64 B	8	$2^{29} / 8 = 67, 108, 864$
256 B	32	$2^{29} / 32 = 16, 777, 216$
1 KB	128	$2^{29} / 128 = 4, 194, 304$
4 KB	512	$2^{29} / 512 = 1, 048, 576$

Table 12. Cipher types and AEAD status.

Scheme	Type	Tag	AEAD?
ASCON-128a	Sponge AEAD	128-bit	✓
ACORN-128	Stream AEAD	128-bit	✓
TinyJAMBU-128	Lightweight AEAD	128-bit	✓
PRESENT-128 (ECB)	Block cipher core	–	×
JAMBU-PRESENT-128	Mode+block-cipher (AEAD)	64-bit	✓

Table 13. Attack Experiment 1: brute-force verification throughput on Arduino Due (DFX disabled).

Device	AEAD Verify ( $μ$ s/Cycles)	BF Tries	BF Avg Rate (/s)
Due @ 84 MHz	224/18,816	43,977	4397.7

Note. The row reports verification throughput only (no successes over a 128-bit tag/key space). At

\approx 4.4 \times 10^{3}

attempts/s, the expected time-to-success for a uniform 128-bit space remains astronomical (

ETA (50 %) \approx 1.2 \times 10^{27}

years).

Table 14. Attack Experiment 2: leakage isolation on Arduino Due (DFX disabled).

Scenario	AEAD Verify ( $μ$ s/Cycles)	OK/10,000	Time/10,000 (s)
Plain ASCON, leaked K	224/18,816	10,000/10,000	2.270
HSF–ASCON, leaked $K_{i}$	224/18,816	1/10,000	2.265

Note.

10, 000

packet times follow the measured

\approx 226.5 μ s

/packet. Because every packet is verified with no early exit, total run-time scales as

\approx 226.5 μ s \times N

regardless of the number of successes; for

N = 10, 000

, this projects to

\approx 2.27 s

. A single leaked session key K compromises all packets in the plain design, whereas a single leaked per-message key

K_{i}

compromises only its packet under HSF, demonstrating damage isolation.

Table 15. Attack-resilience summary for the proposed method.

Attack Method	Effectiveness	Resilience
Brute Force Attack	Very Low	Very High
Side-Channel Attack	Low	Very High
Timing Attack	Low	High
Hardware Manipulation (HTH)	Medium	High
Dynamic Key Resolution Attack	Very Low	Very High
Dictionary Attack	Low	Very High

Table 16. A head-to-head comparison on the evaluated platforms (replaces the former DH table).

Criteria	ASCON-128a	HSF–ASCON	HSF–ASCON (+DFX, 2-RM)
Producer latency (16 B)	≈ $6 μ$ s	≈ $12 μ$ s	≈ $12 μ$ s $+ 90.4$ ms (PR)
Producer energy	≈ $0.002294 μ$ Wh	≈ $0.004589 μ$ Wh	≈ $0.004589 μ$ Wh $+ 5.875 μ$ Wh (PR $Δ$ )
Solver cost (Due verify)	≈ $222 μ$ s (AEAD)	≈ $476 μ$ s (without parse)	same as crypto-only
Keying policy	Long-lived/session key	Per-message (dom-separated)	Per-message (dom-separated)
Damage if a key leaks	Multi-packet exposure	Single-packet exposure	Single-packet exposure
Physical layout stability	Fixed	Fixed (DFX optional)	Moving target (BRAM relocation)
Implementation delta	—	+KDF, pool/index logic	+KDF/pool + DFX/bitstreams

Notes. PR entries reflect the measured 2-RM single-shot window; the 4-RM stress configuration measured

\approx 180.8

ms and

11.75 μ

Wh (Table 9). Crypto-only figures come from Table 7, Table 8 and Table 10. Bold values are used to emphasize the main performance metrics or key results discussed in the text.

Table 17. Zynq-7020 (

1 \times 16

B): crypto-only latency/energy and evaluated compromise scope.

Table 17. Zynq-7020 (

1 \times 16

B): crypto-only latency/energy and evaluated compromise scope.

Method	Latency ( $μ$ s)	Energy ( $μ$ Wh)	Throughput Class	Evaluated Compromise Scope^†
HSF–ASCON-128a (XOF+AEAD)	12	0.004589	$μ$ s/ $μ$ Wh	Single packet (per-message key)
ASCON-128a (AEAD)	6	0.002294	$μ$ s/ $μ$ Wh	All packets in session (single session key)
TinyJAMBU-128	172	0.065769	sub-ms/ $10^{- 2} μ$ Wh	Session-scoped in this evaluation
JAMBU–PRESENT-128	366	0.139951	sub-ms/ $10^{- 1} μ$ Wh	Session-scoped in this evaluation
ACORN-128	500	0.191189	sub-ms/ $10^{- 1} μ$ Wh	Session-scoped in this evaluation

^† Scope reflects the harness configuration used for the attack study (plain AEAD with a single session key versus HSF with per-message keys); AEADs can also be deployed with per-message keys by policy. Energies/latencies from Table 7. AEAD baselines can also be deployed with per-message keys as a policy choice; in the harness, they were measured with a single session key for comparability, whereas HSF enforces per-message keys by design.

Table 18. Energy–performance–security comparison (incl. SCA/MITM/Trojan resistance).

Method	Energy Efficiency	Latency	Security Strength	Advantages	Limitations
Algorithm	Moderate	Moderate	Medium	Cipher agility;	Multiple bitstreams
hopping	(10 mW	(10 ms/swc.)	Cipher	Efficient	needed; no
[14]	dynamic)		diversity	resource use	reconfiguration auth.
Energy-adaptive	Moderate	High	Medium	Power-budget	No bitstream/key
HSM	(≤10	(50–100	Adaptive	Adaptive	security; exploitable
[43]	mW dynamic)	ms/switch)	security	encryption	latency
LifeLine	High	Low	High	Strong obfuscation;	Trusted CPU required;
co-obfuscation	(5% overhead)	(microseconds)	HW/SW split	minimal crypto	no dynamic security
[8]				overhead
Dark-silicon	High	Low	Medium	Low area overhead,	Static key;
obfuscation		(13% delay	Static	compatible	vulnerable to SCA;
[7]	(2% overhead)	overhead)	obfuscation		no runtime flexibility
DPReDO	High	Low	High	Real-time Trojan	Prototype stage;
[45]	(minimal	(microseconds)	Obfuscation	mitigation;	assumes secure
	overhead)			low overhead	reconfiguration port
HSF–ASCON	Very high	Low	High	Low power,	DFX complexity;
(with DFX 2-RM)	(5.875	12 µs	AEAD	Resistant to SCA	storage for encrypted pool;
(this study)	µWh/msg)	+ 90.4 ms	DFX	and MITM	DFX energy window

Crypto-only energy on Zynq-7020:

E_{HSF - ASCON} \approx 0.004589 μ

Wh per

1 \times 16

B; DFX single-shot (2-RM) derived from Table 9.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kıyak, C.B.; Bilge, H.Ş.; Yılmaz, F. A Hybrid Security Framework with Energy-Aware Encryption for Protecting Embedded Systems Against Code Theft. Electronics 2025, 14, 4395. https://doi.org/10.3390/electronics14224395

AMA Style

Kıyak CB, Bilge HŞ, Yılmaz F. A Hybrid Security Framework with Energy-Aware Encryption for Protecting Embedded Systems Against Code Theft. Electronics. 2025; 14(22):4395. https://doi.org/10.3390/electronics14224395

Chicago/Turabian Style

Kıyak, Cemil Baki, Hasan Şakir Bilge, and Fadi Yılmaz. 2025. "A Hybrid Security Framework with Energy-Aware Encryption for Protecting Embedded Systems Against Code Theft" Electronics 14, no. 22: 4395. https://doi.org/10.3390/electronics14224395

APA Style

Kıyak, C. B., Bilge, H. Ş., & Yılmaz, F. (2025). A Hybrid Security Framework with Energy-Aware Encryption for Protecting Embedded Systems Against Code Theft. Electronics, 14(22), 4395. https://doi.org/10.3390/electronics14224395

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Hybrid Security Framework with Energy-Aware Encryption for Protecting Embedded Systems Against Code Theft^†

Abstract

1. Introduction

2. State of Art

2.1. Bitstream Attacks and Code Protection

2.2. Dynamic Reconfiguration and Lightweight Cryptography

3. Materials and Methods

3.1. Hybrid Security Framework (HSF): Design and AEAD Interface

3.1.1. Receiver Side Acceptance Checks

3.1.2. Operational Procedures

3.2. Experimental Setup

3.3. Hardware and Software Design

3.3.1. Platform & Top-Level Design

3.3.2. Memory & Configuration Architecture

3.3.3. Security, Run-Time & Implementation Details

3.3.4. Manuscript Preparation and AI Tool Usage

4. Results

4.1. Measurement Setup and Discipline

4.2. Producer-Side Encryption on Zynq-7020

4.3. Solver-Side Decryption on Arduino Due

4.4. Energy Attribution and DFX Scaling

4.5. Security Analysis and Attack Experiments

4.6. Summary of Results

5. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

A Hybrid Security Framework with Energy-Aware Encryption for Protecting Embedded Systems Against Code Theft †

Abstract

1. Introduction

2. State of Art

2.1. Bitstream Attacks and Code Protection

2.2. Dynamic Reconfiguration and Lightweight Cryptography

3. Materials and Methods

3.1. Hybrid Security Framework (HSF): Design and AEAD Interface

3.1.1. Receiver Side Acceptance Checks

3.1.2. Operational Procedures

3.2. Experimental Setup

3.3. Hardware and Software Design

3.3.1. Platform & Top-Level Design

3.3.2. Memory & Configuration Architecture

3.3.3. Security, Run-Time & Implementation Details

3.3.4. Manuscript Preparation and AI Tool Usage

4. Results

4.1. Measurement Setup and Discipline

4.2. Producer-Side Encryption on Zynq-7020

4.3. Solver-Side Decryption on Arduino Due

4.4. Energy Attribution and DFX Scaling

4.5. Security Analysis and Attack Experiments

4.6. Summary of Results

5. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

A Hybrid Security Framework with Energy-Aware Encryption for Protecting Embedded Systems Against Code Theft^†