Privacy-Preserving Poisoning-Resistant Blockchain-Based Federated Learning for Data Sharing in the Internet of Medical Things

Zhu, Xudong; Li, Hui

doi:10.3390/app15105472

Open AccessArticle

Privacy-Preserving Poisoning-Resistant Blockchain-Based Federated Learning for Data Sharing in the Internet of Medical Things

by

Xudong Zhu

^1,* and

Hui Li

^2,*

¹

School of Information and Control Engineering, Xi’an University of Architecture and Technology, Xi’an 710055, China

²

School of Cyber Engineering, Xidan University, Xi’an 710126, China

^*

Authors to whom correspondence should be addressed.

Appl. Sci. 2025, 15(10), 5472; https://doi.org/10.3390/app15105472

Submission received: 28 March 2025 / Revised: 2 May 2025 / Accepted: 3 May 2025 / Published: 13 May 2025

Download

Browse Figures

Versions Notes

Abstract

The Internet of Medical Things (IoMT) creates interconnected networks of smart medical devices, utilizing extensive medical data collection to improve patient outcomes, streamline resource management, and guarantee comprehensive life-cycle security. However, the private nature of medical data, coupled with strict compliance requirements, has resulted in the separation of information repositories in the IoMT network, severely hindering protected inter-domain data cooperation. Although current blockchain-based federated learning (BFL) approaches aim to resolve these issues, two persistent security weaknesses remain: privacy leakage and poisoning attacks. This study proposes a privacy-preserving poisoning-resistant blockchain-based federated learning (PPBFL) scheme for secure IoMT data sharing. Specifically, we design an active protection framework that uses a lightweight

(t, n)

-threshold secret sharing scheme to protect devices’ privacy and prevent coordination edge nodes from colluding. Then, we design a privacy-guaranteed cosine similarity verification protocol integrated with secure multi-party computation techniques to identify and neutralize malicious gradients uploaded by malicious devices. Furthermore, we deploy an intelligent aggregation system through blockchain smart contracts, removing centralized coordination dependencies while guaranteeing auditable computational validity. Our formal security analysis confirms the PPBFL scheme’s theoretical robustness. Comprehensive evaluations across multiple datasets validate the framework’s operational efficiency and defensive capabilities.

Keywords:

Internet of Medical Things (IoMT); data sharing; federated learning; blockchain; privacy-preserving; poisoning-resistant

1. Introduction

The Internet of Medical Things (IoMT)—a subset of the Internet of Things (IoT)—comprises interconnected smart medical devices, wearable health gadgets, and other medical hardware [1,2]. The widespread adoption of IoMT enables the continuous transfer and exchange of patient medical data among medical institutions, healthcare professionals, devices, and applications. However, due to competitive pressures, medical institutions are often reluctant to share their medical data with others. The transmission of patient health information, which includes sensitive personal and institutional details, poses risks to individual privacy, institutional security, and potentially national security. To overcome these challenges and harness the full potential of IoMT data, secure and trusted data sharing approaches are urgently needed.

Federated learning (FL) represents an innovative approach to machine learning that enables joint model development across decentralized datasets owned by multiple devices, while ensuring that the data are still stored on local devices [3]. This framework achieves knowledge exchange through the periodic aggregation of updated model parameters from participating devices. However, existing federated learning frameworks still face security vulnerabilities, particularly due to the centralized server architecture’s susceptibility to targeted attacks on critical nodes, which may result in unauthorized access to sensitive training datasets [4]. Blockchain’s decentralization, immutability, and traceability [5] have been leveraged for integration with FL frameworks for model parameter aggregation, replacing the traditional central server [6,7]. Blockchain-based federated learning (BFL) enhances the reliability and scalability of FL by mitigating single points of failure (SPFs) and improving communication efficiency [8,9]. Warnat-Herresthal et al. [10] introduced a BFL approach called Swarm Learning (SL), which combines blockchain and FL to collaboratively train models for the diagnosis of multiple diseases while safeguarding patient data privacy and security. However, maintaining gradient data as transactional entries within BFL systems can create security vulnerabilities. Studies have revealed that malicious actors infiltrating blockchain records could launch inference-based assaults to reconstruct sensitive training datasets from exposed gradients [11].

To mitigate these risks, privacy-preserving federated learning (PPFL) frameworks have gained substantial attention across research and industrial domains [12]. The PPFL framework predominantly employs three core methodologies: differential privacy (DP) mechanisms, homomorphic encryption (HE) protocols, and secure multi-party computation (MPC) techniques. Biscotti [13] combines DP with secure aggregation protocols [14] to fortify BFL architectures against both data poisoning attempts and gradient-based inference breaches. Chen et al. [15] illustrated how swarm learning updates can implement partial HE to safeguard BFL systems from inference threats, although this method suffers from operational inefficiencies due to computational complexities in cipher-text processing. MPC-based privacy preservation [16] achieves data indistinguishability through secret sharing mechanisms, but requires significant communication resources due to intensive client–server coordination demands. However, existing PPFL methods primarily ensure the indistinguishability of private information while often neglecting the threat of adversarially manipulated gradients. Attackers can craft malicious gradients to undermine aggregation results and disrupt the training process. Several aggregation rules have been developed to counteract poisoning attacks [17,18,19], which filter out malicious gradients by distinguishing them from legitimate ones. For example, Krum [17] identifies and excludes poisonous gradients by measuring their Euclidean distance from benign gradients. However, these poisoning defense strategies inherently conflict with privacy-preserving FL, which seeks to maintain the indistinguishability of private gradients.

To simultaneously address both poisoning attacks and privacy protection in FL, we propose a novel BFL framework specifically designed for secure medical data exchange within smart healthcare ecosystems. Our contributions are as follows:

A robust global model is maintained through rigorous cosine similarity analysis to filter out harmful gradients, effectively mitigating the risks of data poisoning attacks. This approach enhances model integrity by systematically identifying and removing manipulated parameter updates during distributed training processes.
We propose an actively secure MPC framework that maintains data confidentiality throughout FL processes, effectively addressing the inherent tension between data privacy preservation and robust defense against poisoning attacks.
We establish a hierarchical framework to mitigate poisoning attacks and protect the data privacy of devices while using blockchain, in order to facilitate transparent processes and the enforcement of regulations. This innovative framework facilitates the formation of an autonomously coordinated learning alliance, eliminating the need for a central coordinating authority during model training processes.
Through conducting rigorous evaluations across two widely-accepted benchmark datasets, our proposed framework exhibits superior performance compared to existing cutting-edge methodologies, in terms of operational stability and computational effectiveness.

The remainder of this paper is organized as follows: Section 2 introduces foundational concepts. Section 3 surveys the existing literature. Section 4 describes the proposed framework and its objectives. Section 5 elaborates on the developed approach. Section 6 and Section 7 examine security evaluation and efficiency assessment, respectively. Finally, Section 8 summarizes the key findings and conclusions of the study.

2. Background

Our work is closely related to data sharing in the IoMT, federated learning (FL), blockchain-based federated learning (BFL), and Actively Secure Evaluation Protocol, and we provide relevant background knowledge in this section.

2.1. Data Sharing in IoMT

The Internet of Medical Things (IoMT) facilitates the interconnection of communication-enabled medical-grade devices and their integration into wider-scale health networks in order to improve patients’ health. As shown in Figure 1, patient health data are continually transferred and exchanged between medical institutions, medical workers, medical devices, and medical applications in the IoMT context.

However, IoMT entities often lack sufficient trust, and there is some skepticism regarding IoMT data-sharing platforms. This mistrust creates huge obstacles for IoMT-based data sharing. Therefore, there is an urgent need for IoMT data sharing approaches to fully utilize the potential value of IoMT data.

2.2. Federated Learning (FL) for Data Sharing in the IoMT

As depicted in Figure 2, federated learning (FL) operates as a collaborative machine learning framework in which multiple edge devices jointly develop shared predictive models without having to exchange their private datasets.

Consider a network of N distributed nodes represented by

{C_{1}, C_{2}, \dots, C_{N}}

, each possessing a distinct local training dataset

D_{i}

,

i = {1, 2, \dots, N}

. These participants aim to collaboratively optimize a shared global model w while maintaining strict data confidentiality for their respective information sources. During the

t^{th}

training cycle, the FL process unfolds through three sequential stages:

Step I:: The central server distributes the updated global model $w^{(t - 1)}$ to all participating devices through network transmission.
Step II:: Every participating device develops an individual model through training on its own dataset. In particular, the device $C_{i}$ solves an optimization problem $w_{i}^{(t)} = a r g m i n L (D_{i}, w^{(t - 1)})$ to update the local model $w_{i}^{(t)}$ using stochastic gradient descent, where $L (\cdot)$ is the loss function. These edge devices subsequently transmit their local model updates $g_{i}^{(t)} = w^{(t - 1)} - w_{i}^{(t)}$ to the central server.
Step III:: The server synthesizes these distributed parameters through aggregation algorithms to formulate an enhanced global model $g^{(t)} = \sum_{i = 1}^{n} \frac{| D_{i} |}{S} g_{i}^{(t)}$ , where $| D_{i} |$ is the size of the $i^{th}$ device’s local training dataset, and S is the total number of training examples. Next, the server updates the global model via

$w^{(t)} \leftarrow w^{(t - 1)} - η \cdot g^{(t)},$

(1)

where $η$ is the global learning rate. This global model is then redistributed to all connected devices for subsequent optimization cycles.

2.3. Blockchain-Based Federated Learning (BFL) for Data Sharing in IoMT

A key limitation of federated learning (FL) lies in its reliance on a central server, which significantly impacts the privacy and performance of all devices within the system. In order to address this issue by reducing the network’s dependence on a singular node and improving communication efficiency, blockchain-based federated learning (BFL) leverages blockchain technology to eliminate the necessity of a central server. Figure 3 illustrates how BFL handles model updates by treating them as data within a block, which is shared through a consensus mechanism.

The blockchain is initiated with a single block that includes the initial global model. On the device side, the local model is initialized with the global model

g^{(t)}

and trained on the data

D_{i}

from the

i^{th}

device. Subsequently, the device uploads data to the blockchain by constructing a block comprising a header, the trained model, and the uploader’s identifier (ID). Devices engage directly with a group of miners to acquire the Merkle root of the data before transmitting the block. A smart contract (SC) on the blockchain enforces specific rules to decide when to update the global model. The aggregation mechanisms of BCFL employ an averaging function akin to federated averaging (FedAVG).

2.4. Cosine Similarity

To distinguish between truthful and deceitful gradient vectors, we employ cosine similarity—a widely used metric for quantifying the alignment between two vectors—to assess the directional consistency between a local model gradient (denoted as

g_{i}^{(t)}

) and the aggregated sum gradient vector

G_{s}^{(t)} = \sum_{i = 1}^{N} g_{i}^{(t)}

. Namely,

c s_{i}^{(t)} = c o s θ_{i}^{(t)} = \frac{< g_{i}^{(t)}, G_{s}^{(t)} >}{| | g_{i}^{(t)} | | \cdot | | G_{s}^{(t)} | |} = < \frac{g_{i}^{(t)}}{| | g_{i}^{(t)} | |}, \frac{G_{s}^{(t)}}{| | G_{s}^{(t)} | |} >,

(2)

where

G_{s}^{(t)}

guides the global decent to an appropriate direction,

| | \cdot | |

denotes the

l_{2}

-norm, and

<, >

signifies the dot product between vectors. The cosine similarity metric for

G_{s}^{(t)}

and the gradient

g_{i}^{(t)}

calculates the ratio of their inner product to the product of their respective magnitudes, mathematically expressed as

c o s θ_{i}^{(t)} = \frac{< g_{i}^{(t)}, G_{s}^{(t)} >}{| | g_{i}^{(t)} | | \cdot | | G_{s}^{(t)} | |}

.

2.5. Actively Secure Evaluation Protocol

An MPC-based protocol can ensure the privacy of devices against Byzantine devices by utilizing the concepts of verifiable secret sharing [20] and Lagrange interpolation. Each device i divides the secret

g_{i}

into N committed shares locally and transmits each

{[[g_{i}]]}_{T}^{j}

to all other devices j (

j \in [1, N]

). Collaboration among devices enables secure computation of addition, multiplication by a constant, and multiplication. Addition and multiplication by a constant can be performed easily by each device through direct local operations. For the multiplication of two secrets, complex multiplication is transformed into local addition operations using multiplication triples, thereby reducing online overhead. Further elaboration on this protocol is provided below.

1.: Secret Sharing: Each participant i possesses a secret $g_{i}$ , allowing them to construct a polynomial

$f_{i} (θ) = g_{i} + \sum_{j = 1}^{T} r_{i j} θ^{j},$

(3)

where the coefficients ${r_{i j}}_{j \in [1, N]}$ are selected at random within the field $F_{p}$ , with the constructed polynomial maintaining a maximum degree of $T - 1$ . In particular, the polynomial’s degree remains within $T - 1$ while its coefficients are randomly sampled from the finite field. Through the evaluation of $f_{i} (θ)$ across multiple distinct points, participating entities can obtain

${[[g_{i}]]}_{T} = (f_{i} (1), \dots, f_{i} (j), \dots, f_{i} (N)) .$

(4)

Subsequently, participant i allocates the share ${[[g_{i}]]}_{T}^{j}$ across all other devices j. To maintain the integrity of these distributed shares, participant i also broadcasts verifiable commitments regarding the polynomial coefficients of $f_{i}$ , specifically defined as

$c_{i j} = \{\begin{matrix} ψ^{g_{i}} f o r j = 0 \\ ψ^{r_{i j}} f o r j = 1, \dots, T \end{matrix} .$

(5)

Here, $ψ$ denotes a generator within the finite field $F_{p}$ , with all arithmetic operations being executed under the modulus $λ$ . The prime $λ$ must be appropriately chosen such that $λ - 1$ is divisible by p, ensuring the required algebraic structure for implementation.
Upon receiving the commitments in (2), each device $j \in [1, N]$ can verify the secret share ${[[g_{i}]]}_{T}^{j} = f_{i} (j)$ by checking the equation:

$ψ^{{[[g_{i}]]}_{T}^{j}} = \prod_{k = 0}^{T} c_{i k}^{θ_{j}^{k}},$

(6)

where all mathematical operations within this framework are executed using modulo $λ$ calculations. The cryptographic commitment protocol guarantees the accurate generation of confidential shares derived from Equation (2)’s polynomial expression, consequently establishing proof of their authenticity through verification processes.
2.: Computation: Three types of calculations are permitted in the protocol, as follows:
Addition: Given two confidential parameters $g_{1}$ and $g_{2}$ , represented by their respective shares ${[[g_{1}]]}_{T}$ and ${[[g_{2}]]}_{T}$ , participating devices collectively perform computations on these shared values:

${[[g_{1}]]}_{T} + {[[g_{2}]]}_{T} = {[[g_{1} + g_{2}]]}_{T} .$

(7)

Multiply-by-constant: When provided with a constant $c \in Z_{p}$ and the shared secret ${[[g_{1}]]}_{T}$ corresponding to element $g_{1}$ , all participating devices perform the multiplication operation to scale the threshold-shared value by the given scalar within the prime field:

$c \cdot {[[g_{1}]]}_{T} = {[[c \cdot g_{1}]]}_{T} .$

(8)

Multiplication: For confidential parameters $g_{1}$ and $g_{2}$ , represented as shared values ${[[g_{1}]]}_{T}$ and ${[[g_{2}]]}_{T}$ , combined with precomputed randomized triples ${[[x]]}_{T}$ , ${[[y]]}_{T}$ , ${[[z]]}_{T}$ (satisfying $z = x \cdot y$ ), the process is initiated by calculating ${[[g_{1}]]}_{T} - {[[x]]}_{T} = {[[g_{1} - x]]}_{T}$ . Participants then employ a reconstruction protocol to publicly reveal masked values $e = g_{1} - x$ and $d = g_{2} - y$ , which maintain their statistical randomness. Through algebraic expansion, the product $g_{1} g_{2}$ equates to $z + e g_{2} + d g_{1} - e d$ . This formulation enables distributed computation, where each device independently calculates the combined result using the pre-shared triple components and revealed values.

${[[g_{1} g_{2}]]}_{T} = {[[z]]}_{T} + e {[[g_{2}]]}_{T} + d {[[g_{1}]]}_{T} - e d .$

(9)
3.: Reconstruction: During the secret reconstruction phase, each participant i acquires a secret share of $g$ , denoted as ${[[g]]}_{T}^{i}$ . These shares are subsequently transmitted to the smart contract by all devices, enabling the recovery of $g$ through Lagrange interpolation. More precisely, given a polynomial $f_{g} (x)$ of degree $T - 1$ , the reconstruction employs the Lagrange basis expression $f_{g} (x) = \sum_{i \in C} f_{g} (i) γ_{i} (x)$ . Here, $C \in Z_{p}$ satisfies $| C | = T$ , where $γ_{i} (x)$ represents a polynomial of degree $T - 1$ , defined as $γ_{i} (x) = \prod_{j \in C, j \neq i} \frac{x - j}{i - j}$ . The recombination vector may then be straightforwardly determined through the calculation of $r = {r_{i} | i \in C}$ , defined by $r_{i} = γ_{i} (0)$ . Subsequently, the secret $g$ is recovered as follows:

$g = \sum_{i \in C} r_{i} f_{g} (i) .$

(10)

3. Related Work

This study investigates critical obstacles present in existing FL frameworks, with a concentrated analysis of three distinct research domains.

3.1. Blockchain-Based Federated Learning

Conventional federated learning architectures predominantly rely on centralized servers, introducing vulnerabilities such as single points of failure and possible server misconduct. The integrity of federated learning ecosystems becomes jeopardized when the central servers experience security breaches [21]. Emerging blockchain technology has attracted significant interest, due to its decentralized nature and robust security features. Blockchain-enhanced federated learning (BFL) systems utilize distributed ledger technology to eliminate central server dependencies and associated risks. The BAFFLE framework [22] implements smart contracts (SCs) to orchestrate model storage and monitor participant statuses, executing both model updates and aggregation processes through automated contractual agreements. This approach enhances system resilience against centralized failures while ensuring equitable client participation. Parallel innovations such as BlockFL [23] leverage blockchain-based smart contracts for secure model update exchanges and validation processes, simultaneously resolving single-point vulnerability issues and incentivizing broader device participation through sample-size proportional rewards. Within medical applications, Swarm Learning integrates FL with blockchain technology to enable collaborative disease diagnosis models while preserving patient data confidentiality [10]. Existing implementations nevertheless face challenges, as smart contract-mediated model aggregation generates considerable computational demands and network congestion for blockchain nodes, while lacking mechanisms to detect adversarial gradient contributions. To mitigate these limitations, researchers have proposed committee-consensus BFL frameworks [24] that streamline computational requirements and enhance security against malicious actors, although the determination of optimal committee formation criteria remains an open research question.

3.2. Privacy-Preserving Federated Learning

While existing research has explored various strategies for protecting privacy in federated learning and blockchain systems, the proposed solutions can be primarily grouped into three distinct approaches: 1. Differential Privacy (DP); 2. Homomorphic Encryption (HE); and 3. Secure Multi-Party Computation (MPC).

1.: Differential Privacy (DP): DP mechanisms safeguard blockchain-powered federated learning (BFL) systems through randomized data modification during information exchange. Learning Chain [25] implements such a mechanism by distorting local gradients through probabilistic noise injection based on exponential mechanisms prior to blockchain integration, disseminating these adjusted parameters throughout the BCFL framework. Blade-FL [26] adopts a dual-role architecture where participants simultaneously engage in computational validation and model refinement via decentralized peer-to-peer training, incorporating Gaussian-distributed perturbations during gradient preparation before cryptographic encapsulation. Biscotti [11] enhances BCFL security through a hybrid approach combining DP with multi-party computation techniques, effectively countering both adversarial manipulation and privacy inference attempts.
2.: Homomorphic Encryption (HE): Homomorphic encryption (HE) serves as a cryptographic method to enhance secure learning by allowing computations on cipher-texts without requiring prior decryption. For example, SL+HE [15] implements partial HE to encrypt swarm learning updates, thereby enhancing security against inference attacks in blockchain-based federated learning (BCFL). Similarly, additive HE can be applied in distributed learning systems to protect model updates and maintain gradient confidentiality, as detailed in [12]. Meanwhile, PBFL [27] leverages cosine similarity metrics to detect malicious gradients while employing fully HE for secure aggregation processes.
3.: Secure Multiparty Computation (SMC): This cryptographic technique safeguards the data of participants by producing randomized data points divergent from source information, which are then allocated across participating entities for decentralized processing. Within SMC frameworks, participant-held datasets remain indecipherable until aggregated computational outputs are synthesized through collaborative analysis. Illustrating this paradigm, the MPC-driven PPML framework [16] caters to single-server architectures, while SecureML [28] (alongside Securenn [29]) target distributed multi-server ecosystems requiring coordinated computation protocols.

3.3. Federated Learning Against Poisoning Attacks

Federated learning systems remain vulnerable to multiple poisoning attack variants, which researchers typically classify into two primary dimensions. From an objective perspective, attacks can be divided into non-targeted and targeted attacks. The former seeks to degrade model performance across all test data, while the latter selectively impairs recognition capabilities for specific inputs while maintaining normal prediction accuracy for other data. From another perspective (i.e., examining adversarial capabilities), attacks manifest as either data corruption or parameter manipulation. Malicious actors conducting data poisoning attacks tamper with local training datasets to indirectly influence parameter updates through contaminated samples. In contrast, model poisoning attacks involve direct manipulation of parameter updates on compromised devices through gradient alteration.

To address poisoning threats from malicious participants, various Byzantine-resilient aggregation techniques have been developed for federated learning systems. For example, Krum [17] counters these attacks by identifying gradient vectors with minimal Euclidean distance to their majority neighbors. Trim-mean [19] adopts a ranking approach, excluding extreme model updates before calculating the trimmed average of remaining gradients. RLR [30] offers an efficient defense against back door attempts by dynamically adjusting the server’s learning rate based on sign pattern analysis of client contributions. FLTrust [31] utilizes a curated reference set to generate baseline model updates, effectively neutralizing malicious inputs without relying on client majority assumptions. PEFL [32] achieves dual protection against data poisoning and privacy breaches through homomorphic encryption-based malicious behavior identification in encrypted gradient space. While traditional FL security solutions focus on centralized architectures, blockchain-integrated federated learning (BCFL) enhances security during aggregation through the use of consensus-aligned update validation mechanisms [33].

4. Problem Formulation

In this section, we formalize the problem definition and design goals.

4.1. Problem Definition

In this study, we consider a typical IoMT setting consisting of two types of participants—namely, edge nodes and IoMT devices—which cooperate to achieve the FL training task. The knowledge and capabilities of k malicious IoMT devices in a total of N devices are defined as follows:

Malicious devices can hold their own toxic data, but cannot access the local data of other honest IoMT devices.
Malicious devices can obtain the global model. However, local model updates uploaded by a single honest devices cannot be observed.
Malicious devices can collude and share a common goal to amplify the impact of their malicious attacks.
Malicious devices can launch either targeted or non-targeted attacks.

Based on the assumptions regarding the knowledge and behavior of the malicious devices, it is obvious that malicious IoMT devices can direct the global model in the wrong direction. Due to the poisoning attack of one or more malicious IoMT devices, the accuracy and reliability of the global model may be greatly reduced, thereby leading to the global model’s distrust of honest IoMT devices.

4.2. Design Goals

Our primary aim is to design a blockchain-driven federated learning system (PP-BFL) that integrates data privacy safeguards and robust defense mechanisms against adversarial interference. This architecture seeks to mitigate data poisoning threats while simultaneously optimizing computational efficiency and ensuring model integrity. Key objectives encompass: developing attack-resistant algorithms, implementing lightweight cryptographic protocols, and establishing decentralized verification processes that maintain confidentiality while reducing resource consumption.

Privacy: The primary goal involves securing IoMT device data against breaches while maintaining the privacy of their gradient information. This framework prevents unauthorized entities—whether malicious devices or external actors—from gaining entry to or deducing sensitive details contained within these gradients.
Robustness: The proposed framework should be robust against malicious attacks, which means that the accuracy of the global model should not be affected by malicious IoMT devices.
Efficiency: The proposed framework must prioritize operational efficiency by minimizing both computational demands and data transmission costs while maintaining system performance.
Accuracy: The proposed framework maintains high accuracy levels while ensuring data confidentiality and mitigating adversarial data manipulation. This balance is achieved through cryptographic privacy preservation techniques combined with anomaly detection mechanisms that identify and neutralize malicious input patterns without degrading model performance.
Reliability: All operations must be comprehensively documented to safeguard against potential denial attempts by malicious devices, ensuring accountability throughout system interactions.

5. Proposed Approach and System

This section presents our proposed privacy-preserving, poisoning-defending, blockchain-based federated learning (PPBFL) scheme. First, we present an overview of the system architecture. Next, we discuss the various components of our proposed framework in detail. Table 1 systematically organizes key terms and symbolic representations for reference.

5.1. System Architecture

Figure 4 depicts our proposed framework, which consists of four primary components: 1. Task Publisher; 2. Trusted Authority; 3. IoMT Edge Nodes; and 4. IoMT Devices, supported by 5. blockchain Infrastructure. The roles and interactions of these components are elaborated below.

1.: Task Publisher: To address IoMT application needs, the task publisher (TP) designs the FL model training task, implements it through a blockchain smart contract (SC), and pays a fee as a bonus pool. IoMT devices that meet the requirements can apply to participate in the task. Then, the TP will initialize the model parameters and upload them to the blockchain by way of transactions.
2.: Trusted Authority: The trusted authority (TA) initializes the system by generating and distributing public/private key pairs for IoMT edge nodes.
3.: IoMT Edge Nodes: Edge nodes, functioning as blockchain clients, possess public/private key pairs $(p k_{m}, s k_{m})$ issued by the Trusted Authority (TA). Their responsibilities encompass: (a) registering with the Smart Contract (SC), (b) downloading the global model from the blockchain, (c) collecting secret shares of all gradients and computing partial similarities between these secret shares, and (d) partially aggregating the secret shares of gradients from the selected models.
4.: IoMT Devices: Devices are responsible for: (a) locally training the model, quantizing, and securely sharing gradients each round; (b) broadcasting gradient secret shares to all edge nodes; and (c) downloading the global model from an edge node to update local model parameters.
5.: Blockchain: In the Internet of Medical Things (IoMT), blockchain technology substitutes the traditional federated learning (FL) parameter server. Its responsibilities are: (a) verifying the public keys of registered nodes; (b) gathering secret shares of similarities from these nodes and reconstructing the similarities between local model updates; (c) collecting secret shares of aggregated gradients from all nodes and reconstructing the aggregated gradients.

5.2. Threat Model

Within our security framework, we acknowledge potential scenarios in which medical IoT devices could engage in adversarial behaviors by transmitting manipulated parameter updates to undermine the integrity of the aggregated model. Our analysis distinguishes between two critical vulnerabilities—namely, data exposure risks and model corruption attempts—which are thoroughly examined in subsequent sections.

1.: Privacy leakage: In federated learning processes, compromised IoMT equipment may be able to deduce confidential data from legitimate participants. These adversarial actors could access gradient data submitted by compliant devices through blockchain records to reverse-engineer sensitive parameters. Contrary to protocol-following passive devices, maliciously active participants might alter intermediate computations within secure multiparty protocols. Furthermore, coordinated groups of IoMT devices could cooperate maliciously, amplifying their capacity to compromise the confidentiality of data.
2.: Poisoning attacks: In the training phase, external adversaries might exploit compromised IoMT devices to initiate poisoning attacks. Attackers could potentially intercept sensitive details regarding localized training datasets and parameter adjustments from hijacked nodes. Simultaneously, they might intentionally distort the devices’ optimization gradients to impede the collaborative learning model’s stabilization process and degrade its predictive performance.

We outline the workflow of the proposed protocol in Algorithm 1.

Algorithm 1: Privacy-Preserving Poisoning-Defending BCFL

5.3. Initialization

The blockchain-based federated learning (FL) system is established by initializing system parameters and publishing public parameters on the blockchain. Each device must register as a legitimate participant before engaging in training. The specific processes are executed as follows:

1.: Task Publish: During initialization, system participants (e.g., IoMT devices and edge nodes) establish connections to create a peer-to-peer network. A TP initiates a FL task by implementing an SC on the blockchain network, and pays a fee as a bonus pool. Within the deployed SC, the TP sets the initial public parameter $i p p = (w^{(0)}, δ, b)$ , where $w^{(0)}$ , $δ$ , and b represent the initial weights, learning rate, and training batch size, respectively. Subsequently, other authenticated devices can synchronize with it.
2.: System Initialization: The Trusted Authority (TA) initially selects a security parameter $α$ and two large safe prime numbers, p and q, where $| p | = | q | = α$ . Subsequently, the TA computes $M = p q$ and $λ = l c m (p - 1, q - 1)$ . A generator h of order $(p - 1) (q - 1) / 2$ is then chosen by TA; for example, $h = - a^{2 M}$ , where a is a random number in $Z_{M^{2}}^{*}$ . Following this, the TA picks N unique elements ${θ_{i}}_{i \in [1, N]}$ from $F_{p}$ . Each device $i (1 \leq i \leq N)$ requests the TA for $(h, p, q)$ and N distinct elements ${θ_{i}}_{i \in [1, N]}$ .
3.: Edge Node Register: Let there be m nodes in the system. Each node i $(1 \leq i \leq M)$ randomly chooses a private key $s k_{i} \in Z_{q}^{*}$ and calculates their public key as $p k_{i} = h^{s k_{i}}$ . In order to demonstrate the validity of $s k_{i}$ , a common non-interactive zero-knowledge (NIZK) proof, $p r o o f_{s k_{i}}$ , is generated [34]. The NIZK proving and verification processes are represented by $p r o v e (s k_{i}, p k_{i})$ and $v e r i f y (p k_{i}, p r o o f_{s k_{i}})$ , respectively, as outlined in Algorithms 2 and 3. Subsequently, each node publishes their public key $p k_{i}$ along with the corresponding $p r o o f_{s k_{i}}$ on the blockchain.

Algorithm 2: NIZK Prove Function

Input:: $s k_{i}$ , $p k_{i}$
Output:: $p r o o f_{s k_{i}}$
1:: function prove( $s k_{i}$ , $p k_{i}$ )
2:: $r_{i} \in_{R} Z_{p}^{*}$
3:: $t_{i} \leftarrow h^{r_{i}} m o d p$
4:: $c_{i} \leftarrow H (h | | p k_{i} | | t_{i})$
5:: $s_{i} \leftarrow r_{i} - c_{i} \cdot x_{i}$
6:: $p r o o f_{s k_{i}} = < t_{i}, c_{i}, s_{i} >$
7:: return $p r o o f_{s k_{i}}$
8:: end function

Algorithm 3: NIZK Verify Function

5.4. Local Computation

Local computation comprises model updating, local training, normalization, and secret sharing. The processes of LocalComputation are outlined in Algorithm 1. Below, we elaborate on these components.

1.: Model update: Device $C_{i}$ retrieves the most recent global model $w^{(t - 1)}$ from an edge node. Subsequently, the local model $w_{i}^{(t)}$ is updated using Equation (7), where $δ$ represents the local learning rate. If the devices’ objective function converges, the training process concludes; otherwise, $C_{i}$ proceeds to the next iteration.

$w_{i}^{(t)} = w^{(t - 1)} - δ g_{i}^{(t - 1)} .$

(11)

The specifics of updating the local models are elucidated by invoking ModelUpdate, as defined in Algorithm 4.

Algorithm 4: ModelUpdate

2.: Local training: During the $t^{th}$ iteration of training, each device $C_{i}$ involved in FL leverages its local dataset $D_{i}$ ( $i \in [1, N]$ ) and local model $w_{i}^{(t)}$ to compute the local gradient $g_{i}^{(t)}$ , as defined in Equation (12).

$g_{i}^{(t)} = \nabla L (w_{i}^{(t)}, D_{i}),$

(12)

where $L (w_{i}^{(t)}, D_{i})$ denotes the empirical loss function, with ∇ representing the derivative operation.
3.: Normalization and quantization: For device $C_{i}$ , we employ Equation (13) to normalize the local gradients as follows:

${\tilde{g}}_{i}^{(t)} = \frac{g_{i}^{(t)}}{| | g_{i}^{(t)} | |},$

(13)

where ${\tilde{g}}_{i}^{(t)}$ represents the unit vector. For any integer $q \geq 1$ , we introduce a stochastic rounding function, defined as:

$Q_{q} (x) = \{\begin{matrix} \frac{⌊q x⌋}{q} w i t h p r o b . 1 - (q x - ⌊q x⌋) \\ \frac{⌊q x⌋ + 1}{q} w i t h p r o b . q x - ⌊q x⌋ \end{matrix},$

(14)

where $⌊q x⌋$ denotes the largest integer less than or equal to x. It is important to note that this function is unbiased; that is, $E_{Q} [Q_{q} (x)] = x$ . The parameter q serves as a tuning parameter corresponding to the quantization level. The variance of $Q_{q} (x)$ diminishes with increasing q. Subsequently, we define the quantized gradient as:

${\bar{g}}_{i}^{(t)} : = ϕ (q \cdot Q_{q} ({\tilde{g}}_{i}^{(t)})),$

(15)

where the function $Q_{q}$ from Equation (9) is applied element-wise and a mapping function $ϕ : R \to F_{p}$ is specified to represent a negative integer in the finite field using the two’s complement representation.

$ϕ (x) = \{\begin{matrix} x i f x \geq 0 \\ p - x i f x < 0 \end{matrix} .$

(16)
4.: Secret sharing: Each device $C_{i}$ within the range of $[1, N]$ produces secret shares of the quantized gradient ${\bar{g}}_{i}^{(t)}$ by creating a random polynomial $f_{i} : F_{p} \to F_{p}^{d}$ of degree T:

$f_{i} (θ) = {\bar{g}}_{i}^{(t)} + \sum_{m = 1}^{T} r_{i m} θ^{m},$

(17)

where the vectors $r_{i m}$ are randomly generated from $F_{p}^{d}$ by device i. Subsequently, device i transmits the secret shares:

${[[{\bar{g}}_{i}^{(t)}]]}_{T}^{m} = f_{i} (θ_{m})$

(18)

to edge node m within the range of $[1, M]$ . In order to ensure the verifiability of these shares, device i also publicly discloses the commitments ${c_{i m}}_{m \in [1, M]}$ to the coefficients of $f_{i}$ to all edge nodes. These commitments are defined as:

$c_{i m} = \{\begin{matrix} ψ^{{\bar{g}}_{i}^{(t)}} f o r m = 0 \\ ψ^{r_{i m}} f o r m = 1, \dots, T \end{matrix},$

(19)

where $ψ$ represents a generator of $F_{p}$ , and all computations are performed modulo $λ$ , with $λ$ defined as a large prime such that p divides $λ - 1$ .

5.5. Secure Similarity Computation

Algorithm 1 outlines the similarity computation processes as follows:

1.: Partial similarity computation: In our privacy-preserving similarity computation method, each edge node calculates partial similarities using the secret shares of gradients provided by all devices. Specifically, edge node m computes the partial similarities as

${[[{\bar{c s}}_{i}^{(t)}]]}_{T}^{m} = {[[{\bar{g}}_{i}^{(t)}]]}_{T}^{m} ⊙ \sum_{i = 1}^{T} {[[{\bar{g}}_{i}^{(t)}]]}_{T}^{m},$

(20)

where ${[[{\bar{g}}_{i}^{(t)}]]}_{T}^{m}$ represents the normalized gradient of each device $C_{i}$ , and $\sum_{i = 1}^{T} {[[{\bar{g}}_{i}^{(t)}]]}_{T}^{m}$ denotes the sum of all gradients.
2.: Publication: Each edge node m publicly discloses the partial similarities ${{[[{\bar{c s}}_{i}^{(t)}]]}_{T}^{m}}_{i \in [1, N]}$ on the blockchain.
3.: Similarity reconstruction: Upon receiving adequate partial similarities from multiple edge nodes, the smart contract is activated to reconstruct the quantized similarities ${\bar{c s}}_{i}^{(t)}$ using Lagrange interpretation. Subsequently, the smart contract converts ${\bar{c s}}_{i}^{(t)}$ from the finite field to the real domain.

$c s_{i}^{(t)} = \frac{ϕ^{- 1} ({\bar{c s}}_{i}^{(t)})}{q^{2}}$

(21)

for $i \in [1, N]$ , where q is the integer parameter in Equation (17) and the de-mapping function $ϕ^{- 1} : F_{p} \to R$ is defined as follows:

$ϕ^{- 1} (\bar{x}) = \{\begin{matrix} \bar{x} i f 0 \leq \bar{x} \leq \frac{p - 1}{2} \\ \bar{x} - p i f \frac{p - 1}{2} \leq \bar{x} \leq p \end{matrix} .$

(22)

The variable $c s_{i}^{(t)}$ quantifies the alignment between the normalized gradient ${\tilde{g}}_{i}^{(t)}$ of each device i and the total gradient sum $\sum_{i = 1}^{N} {\tilde{g}}_{i}^{(t)}$ . A negative value of $c s_{i}^{(t)}$ indicates an opposite direction between ${\tilde{g}}_{i}^{(t)}$ and $\sum_{i = 1}^{N} {\tilde{g}}_{i}^{(t)}$ , which can detrimentally impact the global model. To filter out adversarial gradients during aggregation, the rectified linear unit (ReLU) function is employed:

$R e L U (x) = \{\begin{matrix} x i f x > 0 \\ 0 i f x \leq 0 \end{matrix} .$

(23)

This allows us to compute the score $S_{i}^{(t)} = R e L U (c s_{i}^{(t)})$ for each device j. Subsequently, the smart contract broadcasts ${S_{i}^{(t)} = R e L U (c s_{i}^{(t)})}_{i \in [1, N]}$ to all edge nodes.

5.6. Secure Model Aggregation

1.: Local gradient aggregation: Each edge node m locally aggregates the secret shares, weighted by their corresponding scores.

${[[{\bar{g}}^{(t)}]]}_{T}^{m} = \frac{1}{s u m} \sum_{i = 0}^{N} S_{i}^{t} \cdot {[[{\bar{g}}_{i}^{(t)}]]}_{T}^{m},$

(24)

where $s u m = \sum_{i = 1}^{N} S_{i}^{t}$ . The entity then publishes ${[[{\bar{g}}^{t}]]}_{T}^{i}$ to the blockchain for public verification.
2.: Reconstruction of the global gradients: Upon receiving computation results from a sufficient number of devices, the smart contract is activated to reconstruct the quantized global gradients ${\bar{g}}^{(t)}$ through Lagrange interpretation. Subsequently, the smart contract converts ${\bar{g}}^{(t)}$ from the finite field to the real domain as follows:

$g^{(t)} = \frac{ϕ^{- 1} ({\bar{g}}^{(t)})}{q^{2}} .$

(25)

6. Security Analysis

As outlined in Section 3.2, our threat model encompasses two types of security threats: privacy leakage and poisoning attacks. In this section, we concentrate solely on privacy protection, specifically ensuring the confidentiality of IoMT device gradients originating from edge nodes and blockchain systems. Indeed, the security of our PPBFL is based on our actively secure evaluation protocol, which not only ensures privacy protection in an information-theoretic sense, but also resists malicious attacks carried out by the participants (i.e., it provides active security). First, each IoMT device uploads only committed shares of its local gradients to edge nodes throughout the entire training process. Based on the security provided by verifiable Shamir’s secret sharing, even if at most

T - 1

nodes collude with one another, neither the edge nodes nor blockchain can infer any useful information about the devices from these committed shares or forge them. All operations conducted on edge nodes and within the blockchain are executed under a secure multiparty computation (MPC) framework, thus ensuring the confidentiality of device gradients. Next, we present a proof for our scheme using a hybrid approach [35].

Theorem 1.

Assuming there exists a set of malicious edge nodes, denoted as

E_{A t t} \subset {E_{1}, \dots, E_{M}}

holding

| E_{A t t} | < T < M / 3 + 1

, it is always possible to identify a simulator

SIM

with adequate computational power such that, given parameters p, T, and M, indistinguishability between simulator and real protocol can be assured in an information-theoretic sense—even when edge nodes belonging to

E_{A t t}

collaborate with one another.

{SIM}_{E_{A t t}}^{p, T, M} \equiv {REAL}_{E_{A t t}}^{p, T, M} .

(26)

Proof.

Considering REAL’s input of

G_{N} = (g_{1}, \dots, g_{n}, \dots, g_{N})

and SIM’s input of

G_{N}^{'} = (g_{1}^{'}, \dots, g_{n}^{'}, \dots, g_{N}^{'})

with identical distributions, we demonstrate the indistinguishability between

{REAL}_{E_{A t t}}^{p, T, M}

and

{SIM}_{E_{A t t}}^{p, T, M}

as well as the unforgeability against active edge nodes throughout the entire execution process of our scheme, as follows.

${STEP}_{1}$: In this step, each edge node $P_{m}$ receives the committed shares of device gradients. Each node $P_{m}$ in the real protocol takes the shares denoted by ${{[{\bar{g}}_{n}]]}_{T}^{m}}_{n \in [N]}$ , while each node $P_{m}$ in the simulator utilizes ${[[{\bar{g^{'}}}_{n}]]}_{T}^{m}$ . Consequently, indistinguishability between ${REAL}_{E_{A t t}}^{p, T, M}$ and ${SIM}_{E_{A t t}}^{p, T, M}$ is assured.
${STEP}_{2}$: In this step, each edge node $P_{m}$ computes ${{[[{\bar{c s}}_{n}^{(t)}]]}_{T}^{m}}_{n \in [N]}$ . Each node $P_{m}$ in the real protocol calculates ${{[[{\bar{c s}}_{n}^{(t)}]]}_{T}^{m}}_{n \in [N]}$ while simultaneously calculating ${{[[{\bar{c s^{'}}}_{n}^{(t)}]]}_{T}^{m}}_{n \in [N]}$ in the simulator. Given that ${{[[{\bar{c s}}_{n}^{(t)}]]}_{T}^{m}}_{n \in [N]}$ and ${{[[{\bar{c s^{'}}}_{n}^{(t)}]]}_{T}^{m}}_{n \in [N]}$ share an identical distribution due to properties inherent to Shamir’s secret sharing and Lagrange interpolation techniques, we can guarantee their indistinguishability.
${STEP}_{3}$: In this step, each edge node $P_{m}$ transmits the ${[[{\bar{c s}}_{i}^{(t)}]]}_{T}^{m}$ to the smart contract for reconstruction of $c s_{i}^{(t)}$ . In the real protocol, each node $P_{m}$ sends ${[[{\bar{c s}}_{i}^{(t)}]]}_{T}^{m}$ to the smart contract, while in the simulator it sends ${[[{\bar{c s^{'}}}_{i}^{(t)}]]}_{T}^{m}$ to reconstruct ${c s^{'}}_{i}^{(t)}$ . Leveraging the properties of Lagrange interpolation, it can be ensured that only the final reconstructed result is disclosed. Consequently, indistinguishability between ${REAL}_{E_{A t t}}^{p, T, M}$ and ${SIM}_{E_{A t t}}^{p, T, M}$ is guaranteed.
${STEP}_{4}$: In this step, each edge node $P_{m}$ computes ${[[\bar{g}]]}_{T}^{m}$ . Each node performs this calculation as follows: in the real protocol, it calculates ${[[\bar{g}]]}_{T}^{m}$ while, in the simulator, it computes ${[[\bar{g^{'}}]]}_{T}^{m}$ . Given that both ${[[\bar{g}]]}_{T}^{m}$ and ${[[\bar{g^{'}}]]}_{T}^{m}$ share identical distributions due to Shamir’s secret sharing and Lagrange interpolation properties, their indistinguishability can be assured.
${STEP}_{5}$: In this step, each edge node $P_{m}$ submits ${[[\bar{g}]]}_{T}^{m}$ to the smart contract for the reconstruction of $g$ . Specifically, in the real protocol context, every node $P_{m}$ sends its respective ${[[\bar{g}]]}_{T}^{m}$ to facilitate reconstruction of $g$ while, during simulation, they send ${[[\bar{g^{'}}]]}_{T}^{m}$ instead for reconstructing $g^{'}$ . Based on the principles of Lagrange interpolation, only information pertaining to the final reconstructed results may be revealed. Thus, we ensure indistinguishability between ${REAL}_{E_{A t t}}^{p, T, M}$ and ${SIM}_{E_{A t t}}^{p, T, M}$ .

□

7. Experimental Setup

We implemented our scheme on a private Ethereum blockchain setup. The smart contract (SC) layer was developed using the Solidity programming language and deployed on the private blockchain utilizing Truffle. Our experiments were conducted using PyTorch version 2.6.0 running on a workstation equipped with Ubuntu 20.04 OS, an AMD Ryzen Threadripper 3970X (32 cores, 64 threads, 3.7 GHz), an RTX 3090 Ti GPU, and 256 GB of RAM. Simultaneously, we simulated edge nodes using independent threads, with each thread implementing a real-world PyTorch classifier. Additionally, mobile phones (Huawei nova 7, 2.6 GHz, 8 cores, and 8 GB RAM) were employed as model Internet of Medical Things (IoMT) devices. Furthermore, our scheme operated within a finite field

Z_{p}

, with the secure framework based on a

(T, N)

-threshold multi-party computation (MPC)-based protocol. The protocol utilized threshold multi-party computation (MPC), with the parameter p configured as

2^{26} - 5

to avoid data overflow. Standard threshold parameters were predefined as

T = 4

and

N = 10

in this framework.

7.1. Datasets and Settings

To evaluate the performance of our privacy-preserving blockchain federated learning (PP-BFL), we conducted experiments on two widely used datasets: PathMNIST [36] and OCTMNIST [37]. Details regarding these datasets are presented in Table 2. The PathMNIST dataset comprises

80, 000

non-overlapping image patches from hematoxylin and eosin-stained histological images, including a test dataset of

10, 000

image patches from a different clinical center. The dataset is comprised of 9 types of tissues, and each image has dimensions of

3 \times 28 \times 28

pixels. The OCTMNIST dataset comprises

90, 000

valid optical coherence tomography (OCT) images for retinal diseases. The dataset is comprised of four diagnosis categories, and each image is resized to

1 \times 28 \times 28

pixels.

7.2. Poisoning Attacks

In this experiment, we considered both targeted and non-targeted attacks. For non-targeted attacks, our analysis presumed that adversarial devices transmit randomized parameter updates, designed to undermine the global model’s reliability. Conversely, for targeted attacks—specifically assessing two prominent variants: class-redirection and covert-channel—we emulated class-redirection tactics by reassigning the original training labels and produced covert-channel instances through the deliberate insertion of activation patterns into unprocessed training data. Expanded methodological particulars are presented in the subsequent sections.

Label-flipping: We conducted a label-flipping attack on the PathMNIST dataset, where compromised devices relabeled the training dataset’s labels from 2 to 4 to simulate this type of attack.
Backdoor: To construct backdoor examples, we reassigned labels for DRUSEN exhibiting white horizontal stripes as diabetic macular edema (DME) on the OCTMNIST dataset.

7.3. Evaluation Metrics

We utilized test accuracy and test error rate as indicators for models trained on various datasets. The primary objective of our approach was to enhance the accuracy of the global model. Swarm Learning (SL) [10] serves as a widely recognized method in non-adversarial settings within Blockchain Federated Learning (BFL). Consequently, we evaluated the SL scheme without attacks as a baseline comparison against Krum [17], while also presenting results from our proposed scheme under varying numbers of malicious devices.

7.4. FL Settings

In our evaluation setup, we configured the number of devices at

n = 10

, selecting all devices during each iteration throughout training. Our training model was based on a Convolutional Neural Network (CNN) with the following architecture:

I n p u t \to C o n v \to M a x p o o l \to F u l l y C o n n e c t e d \to O u t p u t

. The weight and bias parameters for each layer were defined as follows: for the Conv layer,

w 1 = (10, 1, 3, 3)

and

b 1 = (10, 1)

; Fully Connected layer,

w 2 = (1960, 128)

and

b 2 = (1, 128)

; and the Output layer,

w 3 = (128, 10)

and

b 3 = (1, 10)

. Additional training parameters are summarized in Table 3.

Data allocation per device was performed equally, with each device containing 8000 uniformly distributed data points from the PathMNIST dataset and 9000 from the OCTMNIST dataset. We examined different scenarios regarding the proportion of malicious devices ranging from

10 %

to

50 %

. Our scheme was compared against both the SL and Krum methodologies. The batch size was set at

b = 128

. Furthermore, we observed that the loss function generally converged after approximately 50 rounds; thus, we established the number of training epochs at

e = 50

.

8. Experimental Results

In this section, based on extensive experimental results, we analyze the performance of our scheme from two perspectives: (1) Defensive effectiveness against both targeted and non-target attacks; and (2) computational and communication overheads.

8.1. Defense Effectiveness

To assess the defensive effectiveness of our PPBFL, we conducted simulations of both non-targeted and targeted attacks on the PathMNIST and OCTMNIST datasets. The experiments were analyzed from the perspective of varying attack proportions and different iterations. Additionally, we provide an accuracy comparison with the baseline scheme (SL) [10], as well as state-of-the-art methods such as Krum [17]. Figure 5 and Figure 6 illustrate the training processes over 50 iterations for the PathMNIST and OCTMNIST datasets, respectively. Specifically, despite varying numbers of attackers, our PP-BFL maintained an accuracy rate consistent with that of the baseline under both non-targeted and targeted attacks, thereby achieving robustness alongside high accuracy.

8.2. Overhead of Our Scheme

The computational efficiency of PPBFL is closely tied to its operational performance. We evaluated its resource consumption by measuring computational and communication demands, tracking how these metrics evolve as more devices join the network. Figure 7 demonstrates substantially reduced processing requirements on devices compared to edge servers—a difference stemming from their distinct operational roles. While devices primarily distribute gradient shares, edge nodes shoulder the majority of secure computational tasks. Communication patterns reveal similar disparities, as illustrated in Figure 8. Devices maintain leaner communication channels, mainly transmitting gradient shares to edge nodes and receiving aggregated weight updates. Conversely, edge infrastructure manages denser data exchanges to facilitate effective similarity calculations, requiring multiple coordination steps between network components.

In parallel, we conducted comparative analyses between our PPBFL framework and the MPC-based PPML approach [16], evaluating computational and communication costs across device and edge node operations. The experimental results presented in Figure 7 reveal that PPBFL demonstrated superior performance in terms of computational efficiency, which was particularly noticeable on devices. Notably, our solution maintained stable computational demand regardless of growth in the quantity of devices, while PPML exhibited a quadratic growth pattern in resource consumption with an increasing number, as can be seen from Figure 7a. Corresponding server-side analyses, as shown in Figure 7b, confirmed analogous efficiency advantages. This performance gap stems from PPBFL’s optimized computational architecture, compared to PPML’s more complex operational requirements. The communication metrics illustrated in Figure 8 further highlight PPBFL’s technical superiority, achieving reduced data transmission requirements through minimized interaction rounds between devices and edge nodes. Unlike PPML’s intensive communication protocols requiring multi-party coordination and cloud server mediation, our framework implements streamlined data exchange mechanisms. Additionally, PPML’s architecture demands supplementary communication resources to handle device disconnections, whereas PPBFL inherently maintains operational stability without requiring such compensatory measures.

Operations executed through the smart contract were processed as Ethereum transactions, prompting us to quantify computational expenses through analyzing the gas units consumed during EVM instruction execution. Therefore, on-chain activities were evaluated through systematic gas utilization assessments, which aligned with transaction processing requirements. Under EIP-1559 specifications, the maximum block gas capacity has been raised from 15 million units to 30 million, although standard operational targets are typically maintained at approximately 15 million units. Moreover, the gas consumption caused by the transactions mined in one block cannot exceed the block gas limit. To avoid reaching the gas limit, we split the submit() and aggregate() functions up and imposed restrictions on the number of gradients published in one transaction to 200. As shown in Table 4, we measured each operation’s computational consumption in gas.

The deployment of the smart contract necessitates an expenditure of

3, 602, 173

gas units. The framework initially configures authorized addresses within the permissioned blockchain network to restrict unauthorized participation, requiring

208, 906

gas units for this setup phase. Prospective participants must complete blockchain registration by submitting cryptographic public keys, accompanied by Non-Interactive Zero-Knowledge (NIZK) validation credentials, with the blockchain’s verification and storage of these keys demanding

157, 970

gas per registration instance. The subsequent phase involves devices transmitting gradient data to the blockchain network, with transaction processing capacities handling batches of up to 200 gradients at a gas expenditure of

17, 224, 780

per computational transaction. Following complete cipher-text submissions from all participants, the blockchain executes aggregation operations to derive global gradient parameters, with each aggregation transaction consuming

26, 843, 542

gas units.

9. Discussion

In this section, we first assess the functionality of our proposed scheme by comparing it with several related works. Subsequently, we summarize the performance of our PPBFL.

9.1. Functionality

First, the functionality of our PPBFL was evaluated in comparison to several state-of-the-art approaches, including SL [10], BFLC [24], Krum [17], FLTrust [31], Bift [33], LearningChain [25], SL+HE [15], PPML [16], and Securenn [29]. As illustrated in Table 5, SL [10] replaces the traditional server with a blockchain framework to mitigate the risks associated with single points of failure (SPFs), as well as potential malicious behaviors exhibited by servers.

Note that the security level includes passive security and active security; here, passive security refers to ensuring the security of the devices when the attackers are honest-but-curious (i.e., the attacker only performs passive attacks such as inference attacks), while active security refers to ensuring the security of devices when the attackers are malicious-and-curious (i.e., the attacker not only carries out passive attacks, but also actively performs malicious behaviors such as poisoning attacks). The DP-based LearningChain [25], HE-based SL+HE [15], and MPC-based schemes [16,29] consistently safeguard device privacy through their secure frameworks. Obviously, the above schemes can be categorized as passive security, as they do not consider of defending against poisoning attacks. Meanwhile, the Krum [17], FLTrust [31], and Bift [33] frameworks are capable of defending against poisoning attacks but cannot protect device privacy. Therefore, these schemes cannot be categorized as either passive or active security. In contrast to all of the above-mentioned schemes, our PPBFL employs our MPC-based actively secure evaluation protocol to achieve active security, not only protecting the privacy of devices but also preventing poisoning attacks.

Furthermore, as our actively secure evaluation protocol utilizes

(T, N)

-threshold MPC to construct our secure framework, our PPBFL can resist collusion between at most

T - 1

edge nodes, as well as supporting edge node dropout up to a level of

N - T

.

9.2. Performance

As discussed in Section 8, we conducted a series of experiments to evaluate the efficacy of our PPBFL. Based on the empirical results, the following conclusions can be drawn.

Privacy of Device Data: The PPBFL allows each IoMT device to collaboratively train the FL model locally with their local data. The PPBFL method is unlike the centralized machine learning approach, where the IoMT devices need to send their local data to the cloud for the learning process. Therefore, the proposed FL approach can ensure the privacy of these devices’ sensitive data.
Robustness of Local and Global Model: In our scenario, we consider adversaries that perform poisoning attacks on the IoMT device’s datasets. Such a poisoning attack will lead to a faulty local model and a poisoned global model. Our PPBFL utilizes the cosine similarity to detect and eliminate poisoned local model updates during the aggregation process. Judging from the experimental results, our PPBFL can effectively mitigate the influence of poisoned local model updates. From this result, our PPBFL can guarantee the robustness of both the local and global models.
Privacy of Local Model: In a traditional FL framework, attackers can perform a membership inference attack on the local device models, thus leaking sensitive data from the model. Therefore, we leverage the MPC-based secret sharing protocol, which maintains data confidentiality throughout FL processes, effectively addressing the inherent tension between data privacy preservation and robust defense against poisoning attacks. As the local gradient is protected, model inversion attacks and parameter stealing cannot be performed on the local model by an attacker.
MPC-based Secure Aggregation: In traditional FL approaches, the local models of devices are collected and aggregated into a global model. The aggregation process is the core step of FL to achieve a learning model with higher accuracy. As the aggregation is performed in the blockchain node using MPC-based actively secure evaluation protocol, adversaries cannot tamper with the aggregation process, thus maintaining the model’s accuracy.
Verifiability of the Global Model: As a decentralized technology, blockchain can maintain the integrity of data. In our PPBFL, we leverage blockchain to store the latest global model after the secure aggregation process. The decentralized process makes it impossible for adversaries to tamper with or alter the global model, as this will change the hash value. Later, the global model stored in the blockchain will be sent to the IoMT devices. Moreover, the devices can verify the integrity of the global model by checking the signatures and hashes before they use it.

10. Conclusions

In this study, we proposed a privacy-preserving blockchain-based federated learning (BFL) framework that protects against poisoning attacks, thus promoting safe data sharing in the IoMT context. In particular, the PPBFL was constructed to address the contradictory issues of privacy protection and poisoning defense in BFL. Based on the actively secure evaluation protocol and hierarchical framework, our PPBFL is characterized by its security, utility, robustness, and efficiency, effectively satisfying the demands of practical IoMT.

In the proposed framework, we provide a privacy-preserving training mechanism using a

(T, N)

-threshold MPC-based scheme, which ensures low computation and communication overheads. To resist poisoning attacks, we provide a robust global model by filtering out the harmful gradients using the cosine similarity. Our key insight is that the cosine similarity can be calculated homomorphically, based on our actively secure evaluation protocol, thus making it feasible to simultaneously ensure privacy protection and poisoning defense in FL. Furthermore, the blockchain is used to facilitate transparent processes and the enforcement of regulations, eliminating the need for a central coordinating authority during model training processes. Experiments were carried out on two medical datasets—PathMNIST [36] and OCTMNIST [37] —allowing for comparison of the proposed approach with Swarm Learning (SL) [10] and Krum [17]. The experimental results demonstrated that the proposed scheme can resist model poisoning attacks and achieve high global model accuracy.

In the future, we plan to develop an efficient consensus mechanism for PPBFL, in order to reduce computational and energy resources, as well as an incentive mechanism for PPBFL, in order to drive participants to actively and honestly take part in FL training tasks. Meanwhile, we plan to broaden the scope of PPBFL in the future in order to better support heterogeneous models in blockchain-based federated learning (BFL) for data sharing in IoMT.

Author Contributions

Conceptualization, X.Z.; Methodology, X.Z.; Software, X.Z.; Writing—original draft, X.Z.; Writing—review & editing, H.L.; Supervision, H.L.; Project administration, H.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Publicly available datasets were analyzed in this study. This data can be found here: [https://medmnist.com/].

Conflicts of Interest

The authors declare no conflicts of interest.

References

Gatouillat, A.; Badr, Y.; Massot, B.; Sejdic, E. Internet of Medical Things: A review of recent contributions dealing with cyber-physical systems in medicine. IEEE Internet Things J. 2018, 5, 3810–3822. [Google Scholar] [CrossRef]
Philip, N.Y.; Rodrigues, J.J.; Wang, H.; Fong, S.J.; Chen, J. Internet of Things for in-home health monitoring systems: Current advances, challenges and future directions. IEEE J. Sel. Areas Commun. 2021, 39, 300–310. [Google Scholar] [CrossRef]
Li, T.; Sahu, A.K.; Talwalkar, A.; Smith, V. Federated learning: Challenges, methods, and future directions. IEEE Signal Process. Mag. 2020, 37, 50–60. [Google Scholar] [CrossRef]
Mothukuri, V.; Parizi, R.M.; Pouriyeh, S.; Huang, Y.; Dehghantanha, A.; Srivastava, G. A survey on security and privacy of federated learning. Future Gener. Comput. Syst. 2021, 115, 619–640. [Google Scholar] [CrossRef]
Nakamoto, S. Bitcoin: A Peer-to-Peer Electronic Cash System. 2008. Available online: https://bitcoin.org/bitcoin.pdf (accessed on 2 May 2025).
Liu, Y.; Yu, W.; Ai, Z.; Xu, G.; Zhao, L.; Tian, Z. A blockchain-empowered federated learning in healthcare-based cyber physical systems. IEEE Trans. Netw. Sci. Eng. 2022, 10, 2685–2696. [Google Scholar] [CrossRef]
Nguyen, D.C.; Ding, M.; Pham, Q.V.; Pathirana, P.N.; Le, L.B.; Seneviratne, A.; Li, J.; Niyato, D.; Poor, H.V. Federated learning meets blockchain in edge computing: Opportunities and challenges. IEEE Internet Things J. 2021, 8, 12806–12825. [Google Scholar] [CrossRef]
Hou, D.; Zhang, J.; Man, K.L.; Ma, J.; Peng, Z. A systematic literature review of blockchain-based federated learning: Architectures, applications and issues. In Proceedings of the 2021 2nd Information Communication Technologies Conference (ICTC), Nanjing, China, 7–9 May 2021; pp. 302–307. [Google Scholar]
Li, C.; Yuan, Y.; Wang, F.-Y. Blockchain-enabled federated learning: A survey. In Proceedings of the 2021 IEEE 1st International Conference on Digital Twins and Parallel Intelligence (DTPI), Beijing, China, 15 July–15 August 2021; pp. 286–289. [Google Scholar]
Warnat-Herresthal, S.; Schultze, H.; Shastry, K.L.; Manamohan, S.; Mukherjee, S.; Garg, V.; Sarveswara, R.; Händler, K.; Pickkers, P.; Aziz, N.A.; et al. Swarm learning for decentralized and confidential clinical machine learning. Nature 2021, 594, 265–270. [Google Scholar] [CrossRef]
Shayan, M.; Fung, C.; Yoon, C.J.M.; Beschastnikh, I. Biscotti: A blockchain system for private and secure federated learning. IEEE Trans. Parallel Distrib. Syst. 2021, 32, 1513–1525. [Google Scholar] [CrossRef]
Phong, L.T.; Aono, Y.; Hayashi, T.; Wang, L.; Moriai, S. Privacy-preserving deep learning via additively homomorphic encryption. IEEE Trans. Inf. Forensics Secur. 2018, 13, 1333–1345. [Google Scholar] [CrossRef]
Jayaraman, B.; Evans, D. Evaluating differentially private machine learning in practice. In Proceedings of the 28th USENIX Security Symposium (USENIX Security 19), Santa Clara, CA, USA, 14–16 August 2019; pp. 1895–1912. [Google Scholar]
Shamir, A. How to share a secret. Commun. ACM 1979, 22, 612–613. [Google Scholar] [CrossRef]
Chen, L.; Fu, S.; Lin, L.; Luo, Y.; Zhao, W. Privacy-preserving swarm learning based on homomorphic encryption. In Algorithms Architectures Parallel Process; Kluwer Academic/Plenum Publishers: Cham, Switzerland, 2022; pp. 509–523. [Google Scholar]
Bonawitz, K.; Ivanov, V.; Kreuter, B.; Marcedone, A.; McMahan, H.B.; Patel, S.; Ramage, D.; Segal, A.; Seth, K. Practical secure aggregation for privacy-preserving machine learning. In Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, Dallas, TX, USA, 30 October–3 November 2017; pp. 1175–1191. [Google Scholar]
Blanchard, P.; ElMhamdi, E.M.; Guerraoui, R.; Stainer, J. Machine learning with adversaries: Byzantine tolerant gradient descent. In Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; pp. 118–128. [Google Scholar]
Chen, Y.; Su, L.; Xu, J. Distributed statistical machine learning in adversarial settings: Byzantine gradient descent. In Proceedings of the ACM on Measurement and Analysis of Computing Systems, New York, NY, USA, 1 December 2017; pp. 1–25. [Google Scholar]
Yin, D.; Chen, Y.; Ramchandran, K.; Bartlett, P. Byzantine-robust distributed learning: Towards optimal statistical rates. In Proceedings of the Thirty-Fifth International Conference on Machine Learning, Stockholm, Sweden, 5 May 2018; pp. 5650–5659. [Google Scholar]
Feldman, P. A practical scheme for non-interactive verifiable secret sharing. In Proceedings of the 28th Annual Symposium on Foundations of Computer Science, Los Angeles, CA, USA, 12–14 October 1987; IEEE: Piscataway, NJ, USA, 1987; pp. 427–438. [Google Scholar]
Yang, Q. Toward responsible AI: An overview of federated learning for user-centered privacy-preserving computing. ACM Trans. Interactive Intell. Syst. 2021, 11, 32. [Google Scholar] [CrossRef]
Ramanan, P.; Nakayama, K. BAFFLE: Blockchain based aggregator free federated learning. In Proceedings of the 2020 IEEE International Conference on Blockchain (Blockchain), Rhodes, Greece, 2–6 November 2020; pp. 72–81. [Google Scholar]
Kim, H.; Park, J.; Bennis, M.; Kim, S.L. Blockchained on-device federated learning. IEEE Commun. Lett. 2020, 24, 1279–1283. [Google Scholar] [CrossRef]
Li, Y.; Chen, C.; Liu, N.; Huang, H.; Zheng, Z.; Yan, Q. A blockchain-based decentralized federated learning framework with committee consensus. IEEE Netw. 2021, 35, 234–241. [Google Scholar] [CrossRef]
Chen, X.; Luo, J.; Liao, C.W.; Li, P. When machine learning meets blockchain: A decentralized, privacy-preserving and secure design. In Proceedings of the 2018 IEEE International Conference on Big Data (Big Data), Seattle, WA, USA, 10–13 December 2018; pp. 1178–1187. [Google Scholar]
Li, J.; Shao, Y.; Wei, K.; Ding, M.; Ma, C.; Shi, L.; Han, Z.; Poor, H.V. Blockchain assisted decentralized federated learning (BLADE-FL): Performance analysis and resource allocation. IEEE Trans. Parallel Distrib. Syst. 2022, 33, 2401–2415. [Google Scholar] [CrossRef]
Miao, Y.; Liu, Z.; Li, H.; Choo, K.K.R.; Deng, R.H. Privacy-Preserving Byzantine-Robust Federated Learning via Blockchain Systems. IEEE Trans. Inf. Forensics Secur. 2022, 17, 2848–2861. [Google Scholar] [CrossRef]
Mohassel, P.; Zhang, Y. Secureml: A system for scalable privacy- preserving machine learning. In Proceedings of the 2017 IEEE Symposium on Security and Privacy (SP), San Jose, CA, USA, 22–26 May 2017; pp. 19–38. [Google Scholar]
Wagh, S.; Tople, S.; Benhamouda, F.; Kushilevitz, E.; Mittal, P.; Rabin, T. Falcon: Honest-majority maliciously secure framework for private deep learning. Proc. Priv. Enhancing Technol. 2020, 1, 188–208. [Google Scholar] [CrossRef]
Ozdayi, M.S.; Kantarcioglu, M.; Gel, Y.R. Defending against back-doors in federated learning with robust learning rate. In Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI-21), Vancouver, BC, Canada, 2–9 February 2021; pp. 9268–9276. [Google Scholar]
Cao, X.; Fang, M.; Liu, J.; Gong, N.Z. Fltrust: Byzantine-robust federated learning via trust bootstrapping. In Proceedings of the ISOC Network and Distributed System Security Symposium (NDSS), Virtually, 21–25 February 2021; pp. 21–25. [Google Scholar]
Liu, X.; Li, H.; Xu, G.; Chen, Z.; Huang, X.; Lu, R. Privacy-enhanced federated learning against poisoning adversaries. IEEE Trans. Inf. Forensics Secur. 2021, 16, 4574–4588. [Google Scholar] [CrossRef]
He, Y.; Huang, K.; Zhang, G.; Yu, F.R.; Chen, J.; Li, J. Bift: A blockchain-based federated learning system for connected and autonomous vehicles. IEEE Internet Things J. 2022, 9, 12311–12322. [Google Scholar] [CrossRef]
Schoenmakers, B.; Veeningen, M. Universally verifiable multiparty computation from threshold homomorphic crypto-systems. In Proceedings of the International Conference on Applied Cryptography and Network Security, New York, NY, USA, 2–5 June 2015; pp. 3–22. [Google Scholar]
Xu, G.; Li, H.; Liu, S.; Yang, K.; Lin, X. Verifynet: Secure and verifiable federated learning. IEEE Trans. Inf. Forensics Secur. 2019, 15, 911–926. [Google Scholar] [CrossRef]
Kather, J.N.; Krisam, J.; Charoentong, P.; Luedde, T.; Herpel, E.; Weis, C.-A.; Gaiser, T.; Marx, A.; Valous, N.A.; Ferber, D.; et al. Predicting survival from colorectal cancer histology slides using deep learning: A retrospective multicenter study. PLoS Med. 2019, 16, 1–22. [Google Scholar] [CrossRef]
Kermany, D.S.; Goldbaum, M.; Cai, W.; Valentim, C.C.; Liang, H.; Baxter, S.L.; McKeown, A.; Yang, G.; Wu, X.; Yan, F.; et al. Identifying medical diagnoses and treatable diseases by image-based deep learning. Cell 2018, 172, 1122–1131.e9. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Data sharing in IoMT.

Figure 2. Federated learning (FL) for data sharing in IoMT.

Figure 3. Blockchain-based federated learning (BFL) for data sharing in IoMT.

Figure 4. System model.

Figure 5. Accuracies of our scheme and other schemes when implementing un-targeted attack over PathMINST data, where Att denotes the percentage of attackers.

Figure 6. Accuracies of our scheme and other schemes when implementing label-flipping attack over OCTMNIST data, where Att denotes the percentage of attackers.

Figure 7. Computation comparison between PPBFL and PPML. (a) Device side. (b) Server side.

Figure 8. Communication comparison between PPBFL and PPML. (a) Device side. (b) Server side.

Table 1. Main notation used in this study.

Notation	Description
$D_{i}$	Training datasets of the device $C_{i}$
$w_{i}^{(t)}$	Local model of the device $C_{i}$ in the $t^{th}$ iteration
$η$	Global learning rate
$δ$	Local learning rate
$g_{i}^{(t)}$	Gradient of the device $C_{i}$ in the $t^{th}$ iteration
$G_{s}^{(t)}$	Sum of the gradients in the $t^{th}$ iteration
$c s_{i}^{(t)}$	Cosine similarity between $g_{i}^{(t)}$ and $G_{s}^{(t)}$
$S_{i}^{(t)}$	Score of the device $C_{i}$ in the $t^{th}$ iteration
$b / e / l$	Batch size/Global iterations/Local iterations
${[[g_{i}^{(t)}]]}_{T}^{j}$	$j^{th}$ share of the secret $g_{i}^{(t)}$

Table 2. Descriptions of datasets.

Dataset	Training Set	Class	Test Set	Data Type
PathMNIST	$80, 000$	9	$10, 000$	image
OCTMNIST	$90, 000$	4	$10, 000$	image

Table 3. Training configuration.

Parameter	Value
No. of iterations	1500
No. of epochs	1
Learning rate	$0.5$
Minimal batch size	64

Table 4. Gas cost of each function in TrainFL smart contract.

Function	Gas
construct()	3602173
setEligible()	209906
register()	157970
submit()	17224780
similarity()	20643343
aggregate()	26843542

Table 5. A comparative summary between our scheme and previous schemes.

Schemes	Resisting	Privacy	Poisoning	Light-	Security
	Against SPOF	Protection	Defense	Weight Comp.	Level
SL	√	×	×	−	−
[10]	(Blockchain)
SL+HE	√	√	×	×	Passive
[15]	(Blockchain)	(Paillier)
Learning	√	√	√	√	Passive
Chain [25]	(Blockchain)	(DP)
PPML	√	√	×	−	Passive
[16]	(Single-Server)	(SMC)
Securenn	√	√	×	×	Passive
[29]	(Triple-Server)	(SMC)
Krum	×	×	√	−	−
[17]	(Single-Server)
FLTrust	×	×	√	−	−
[31]	(Single-Server)
BFLC	√	×	√	−	−
[24]	(Blockchain)
Bift	√	×	√	−	−
[33]	(Blockchain)
Our	√	√	√	√	Active
Scheme	(Blockchain)	(SMC)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhu, X.; Li, H. Privacy-Preserving Poisoning-Resistant Blockchain-Based Federated Learning for Data Sharing in the Internet of Medical Things. Appl. Sci. 2025, 15, 5472. https://doi.org/10.3390/app15105472

AMA Style

Zhu X, Li H. Privacy-Preserving Poisoning-Resistant Blockchain-Based Federated Learning for Data Sharing in the Internet of Medical Things. Applied Sciences. 2025; 15(10):5472. https://doi.org/10.3390/app15105472

Chicago/Turabian Style

Zhu, Xudong, and Hui Li. 2025. "Privacy-Preserving Poisoning-Resistant Blockchain-Based Federated Learning for Data Sharing in the Internet of Medical Things" Applied Sciences 15, no. 10: 5472. https://doi.org/10.3390/app15105472

APA Style

Zhu, X., & Li, H. (2025). Privacy-Preserving Poisoning-Resistant Blockchain-Based Federated Learning for Data Sharing in the Internet of Medical Things. Applied Sciences, 15(10), 5472. https://doi.org/10.3390/app15105472

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Privacy-Preserving Poisoning-Resistant Blockchain-Based Federated Learning for Data Sharing in the Internet of Medical Things

Abstract

1. Introduction

2. Background

2.1. Data Sharing in IoMT

2.2. Federated Learning (FL) for Data Sharing in the IoMT

2.3. Blockchain-Based Federated Learning (BFL) for Data Sharing in IoMT

2.4. Cosine Similarity

2.5. Actively Secure Evaluation Protocol

3. Related Work

3.1. Blockchain-Based Federated Learning

3.2. Privacy-Preserving Federated Learning

3.3. Federated Learning Against Poisoning Attacks

4. Problem Formulation

4.1. Problem Definition

4.2. Design Goals

5. Proposed Approach and System

5.1. System Architecture

5.2. Threat Model

5.3. Initialization

5.4. Local Computation

5.5. Secure Similarity Computation

5.6. Secure Model Aggregation

6. Security Analysis

7. Experimental Setup

7.1. Datasets and Settings

7.2. Poisoning Attacks

7.3. Evaluation Metrics

7.4. FL Settings

8. Experimental Results

8.1. Defense Effectiveness

8.2. Overhead of Our Scheme

9. Discussion

9.1. Functionality

9.2. Performance

10. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI