A Fair Crowd-Sourced Automotive Data Monetization Approach Using Substrate Hybrid Consensus Blockchain
Abstract
:1. Introduction
- Data Provenance: Knowledge of the origin of data to generate insights and void bias.
- Data Privacy: Right to access, rectification, erasure, processing and portability guaranteed inside the European Union by the General Data Protection and Regulations (GDPRs) while handling data.
- Data Protection: Securing the data is essential as it comes at a high cost if availability is not maintained as earlier; even the Toy Story Movie Franchise had the risk of becoming obsolete when a technician accidentally deleted it.
- Data Preparation: It is necessary to clean insight extraction to make it AI-usable as data quality need to be augmented.
- A data monetization architecture based on bidding and guaranteeing data privacy is proposed, incentivizing the data creator being a vehicle, the intermediary being the vehicle OEM and finally the data consumer being the radar OEM.
- Ensuring the credibility and the validity of the data with a review system enabled through blockchain smart contracts.
- A new blockchain platform, Substrate, is chosen for implementing the solution which has the features of embedded Smart Contracts, hybrid consensus of block proposal and finalization algorithms.
- The solution is evaluated for its performance on a cloud platform to measure its throughput, consensus forks, and finality of the transactions.
- The first part introduces data monetization and its importance as well as different commercial decentralized mobility solutions.
- Next, we follow with the state of the art on data monetization and evaluation as well as comparison for each of the works. It is followed by a discussion of different mobility data standards.
- We then continue with our data monetization architecture, blockchain platform, and hybrid consensus. We continue with the data monetization use-case definition, architecture proposal and implementation details on the cloud infrastructure.
- We then end with the functional, performance evaluation with more focus on the hybrid consensus algorithms and a conclusion with future work proposals.
1.1. Data Monetization
- Asset Sale: Sale of Direct Data by Strava, Verizon Wireless.
- Business Process Improvement: Value is extracted from data for optimization of one’s business process by Lufthansa, ThyssenKrupp, and Deutsche Bank.
- Product/Service Innovation: Offering new business services or processes based on data by IBM, Rolls Royce.
- Product/Service Optimization: Optimization of existing service based on data by Ford, Zara and Pirelli.
- Data Insight Sale: Selling derived knowledge by analytics, visualization by Olery, Sendify and DealAngel.
- Contextualization: Addition of supplementary over the existing data for economic benefits by Staples and Walmart.
- Individualization: Customer data are used to customize the product offering and preferences, enhancing the value proposition by eBay, Daimler or Netflix.
1.2. Decentralized Mobility Solutions
- Digital Infrastructure for Moving Objects (DIMO): [3] It is a data-driven decentralized public IoT platform that requires the vehicle owners to install an AutoPi telematics unit in their vehicle, which then submits the data at frequent intervals to the cloud infrastructure. Its infrastructure comprises the Polygon Blockchain, the Inter-Planetary File Storage (IPFS) decentralized storage, the Helium decentralized LoRaWAN wireless network and other additional protocols. Users can submit their data and can gain DIMO tokens as rewards. The platform then utilizes the collected data for receiving recommendations on preventive vehicle maintenance, service history, and other sensor records. The shared data are then sold to aggregators who reward and incentivize the DIMO network and vehicle owners. The compromise we notice with this system is that the data life-cycle is questionable regarding its provenance, destruction, obfuscation by removing sensitive details and the overhead of installing a device for protocol communication. Though the vehicle Users and OEMs monetise their raw data directly they lose the opportunity by failing to enhance the data and build services on top of it which might increase their revenue. Although it is designed to solve the problems of data centralization and privacy concerns, it needs more analysis, as the data are stored by centralized actors in the ecosystem.
- Ocean Protocol: The Ocean Protocol [4] is a data marketplace built on the parity Ethereum Proof of Authority blockchain protocol. It is a decentralized data exchange where individuals or enterprises exchange the data shared as ERC721 data NFT tokens by Ocean smart contracts. Then, ERC20 data tokens are generated for the data service to access the published data for a dynamic or fixed price. The existing use cases generated are for connected living, where the elderly patient’s data are shared for customized insurance and medical assistance. Also, the health data for heart condition patients are shared with Roche Diagnostics from a CoaguChek IN Range device to the ocean protocol, enabling data discovery for other third-party partners offering medical services. Also, a data NFT can be marked as purgatory or unusable if there is an issue with data privacy, quality or copyright infringement. To preserve the confidentiality of the data, the marketplace offers compute-to-data, where instead of the data transfer, the buyer of tokens can obtain the pre-computed trained data model to be used, and the raw data stay in the storage of the data marketplace. This protocol is quite comprehensive with no additional hardware requirements, but there is a problem with the data certification, as data shared can be tracked for their provenance. Still, data quality and genuineness are pointers to be considered by them as anyone can share data and earn tokens without pre-emptive checks.
1.3. State of the Art
1.4. Decentralised Mobility Data Standards
1.5. Mobility Open Blockchain Initiative (MOBI)
1.6. European Telecommunications Standards Institute (ETSI)
1.6.1. International Standard Organisation/TC 307
1.6.2. European Union Blockchain Observatory and Forum
2. Significance of Data Monetization Architecture
- Vehicle OEMs are hit by the wave of Industry 4.0 with the advent of Digital Twin, where all the configuration and data of a vehicle are stored in the cloud. In close accordance is the launch of a Software-Defined Vehicle (SDV) technology. [15] allowing to “centralize data with other components of ADAS, bodywork, chassis, and telematics in a Physical computer unit”. Also, all the customization and upgrades are made over the FOTA (Firmware Over The Air) technology, which is simplified by at most two High-Performance Computers (HPC) for SDV in a vehicle. All these Vehicle-to-Cloud communications, as well as the data that are shared, create the necessity of moving towards a decentralized data monetization where each participant in the ecosystem of OEM like Renault, Component Manufacturers like Qualcomm, Cloud Service providers like Google and the vehicle owner can each own a piece of the pie (service built on top of data and rewards) offered by the data, ensuring mutual benefit.
- Data which the vehicle owner engender have to create a virtuous cycle where each piece of data, starting from the sensor data, can be used by the equipment OEMs like Bosch, Continental. for improvisation, Vehicle OEM can enable the provisioning of data and a vehicle owner can obtain service benefits and up-gradation from the equipment OEM. All these actors create a virtuous cycle of fidelity and continuous improvement complemented by data monetization.
- We design our architecture from a consortium point of view with benefits shared with each actor and guarding data privacy, provenance, certification, accountability and validity with the closed set of participants, which is needed for an automotive ecosystem.
2.1. Next-Generation Distributed Ledger
- Consensus Scalability: As we noticed in our previous works related to Byzantine Fault Tolerance (BFT) blockchains except Clique and QBFT, other protocols’ performance was affected as nodes were increased even from a consortium setting of limited participants. Also, in Clique, the finalization of transactions was a questionable aspect, and from an automotive context, we need finalized transactions as security cannot be compromised.
- Interoperability: Blockchain solutions we designed earlier were not interoperable with other blockchains limiting their communication to receive external information and create new synergies and services, which is needed from the current Web3 view standpoint.
- Embedded Smart Contract: Ethereum smart contract, which we used earlier to create ERC20 and ERC721 tokens, used an EVM (Ethereum Virtual Machine) platform. It uses EVM for the interpretation, execution and finalization of smart contract transactions, which introduces additional processing bottlenecks apart from the transaction consensus and processing. But we envisage building the smart contract and the blockchain node binary to avoid separate execution in an EVM for faster and more secure processing.
- Secured Oracle and OffChain Communication: Ethereum-based smart contracts need a third-party decentralized Ethereum-based Oracle for any external communication or knowledge feed to the smart contract decision logic. These are Chainlink, Band protocol, Pyth Network, and others, which are third-party networks to be depended upon, which defeats the purpose of a decentralized solution. So we need an OffChain worker solution that can connect seamlessly to our internal and external Application Programming Interfaces (APIs) or blockchain networks in the form of Oracles, which we solve in our architecture.
- Mutual and Divisible Monetization: Data that need to be monetized have to be beneficial and sustainable by attributing rewards for everyone in the ecosystem and creating new services and enhanced customer experience, value addition, and product evolution.
- Cycle of Data Certification and Provenance: Data that are shared have to be collated from multiple actors or devices, processed, cleaned, obfuscated for privacy concerns and finally submitted via an API or data pool even for a simple raw data monetization without any computation. These successive stages must be acknowledged from their nascent stage until the end stage by certifying at each step as a Signature or Hash and submitting to the decentralized protocol, ensuring a data certification and provenance cycle.
- Privacy By Design:1 Our architecture is designed to follow Article 25 of GDPR principles of “Privacy By Design” rather than “Privacy By Default” [19,20]. In our earlier work on data certification, we followed Privacy by Default through pseudonymization. In this work, we ensure that during data monetization, we follow Privacy by Design, where the data can be read, accessed and verified only by the necessary participants without being public to anyone in the ecosystem.
2.2. Blockchain Framework by Design: Substrate
2.3. Hybrid Consensus
- Block Authoring: It processes the transactions or extrinsic values and packages them into blocks by the set of validators for each discrete time slot. This is usually performed by a chosen validator in a round-robin selection by Authority Round (AuRa) consensus or Blind Assignment for a Blockchain Extension (BABE) scheme. These blocks are then subjected to the addition as agreed chain storage where a block header contains a reference to its parent or a previous block.
- Block Finalization: A previously authored blockchain contains a chain of blocks and can be subjected to forks where two blocks refer to a single parent block. So to resolve the fork or finalize the chain, we need an additional algorithm to select the best chain, which is the longest chain with higher weightage as in Greediest Heaviest Observed Sub-Tree-based Recursive Ancestor-Deriving Prefix Agreement (GRANDPA). This ensures a deterministic finality where a chain can never be compromised, as only block authoring offers a probabilistic finality.
Cross-Consensus Messaging (XCMP)
2.4. Authority Round (AuRa)
2.5. Blind Assignment for Blockchain Extension (BABE)
2.6. Greediest Heaviest Observed Sub-Tree-Based Recursive Ancestor-Deriving Prefix Agreement (GRANDPA)
3. Use-Case Definition
3.1. RADAR Significance
3.2. Road RADAR Signature
- Localization layer: Vehicle position is determined based on its driving lane and merged with RADAR road signature, including the video localization map. Also, the object information in the vicinity is compared with the information from the other sensors to determine relative position.
- Planning Layer: This layer provides driving planning information comprising the road course, traffic sign and speed limit, including bends and gradients.
- Dynamic Layer: Traffic information, including deadlocks, construction work, maintenance or parking space availability, is provided.
3.3. Virtuous Cycle of Road RADAR Signature Data
- Asset Component: This component creates an asset for vehicle OEM as a non-fungible ERC 721 token in the network. RADAR data of the vehicles for a defined condition of localization, time period, error tolerance and other miscellaneous metadata are represented as an asset along with its floor and ceiling price.
- Asset Service Component: The component is similar to the asset component but consecrated towards the road RADAR signature map, which is offered by the RADAR OEM to the vehicle users for enhanced safety, autonomicity, and optimized localization information which is accompanied by a ceiling and floor price.
- Bidding Component: Each asset or asset service created as an ERC 721 token has its price determined based on a transparent bidding process between RADAR OEM, Vehicle OEM and Vehicle Users consenting to the transaction.
- OffChain Component: As soon as the asset or the asset service is sold, data must be offloaded from each vehicle to RADAR OEM via Vehicle OEM. Vehicle OEM acts as a guarantor or mediator in the blockchain for initial data collation, data cleaning and obfuscating of any sensitive or private data. These data processing steps are performed offchain in a cloud middleware but closely looped with the blockchain to certify the process and the data.
- Asset Data Collation and Certification: The data offloaded to the RADAR OEM are then processed, analyzed and interpolated with map data to be offered as a service later completing the asset finalization along with certification along its entire process.
- Asset Service Offering: Road RADAR map signature is offered to the consenting vehicle back in the blockchain ecosystem with an interest of subscriptions collated in the ledger by Vehicle OEM and transaction completion post the fund transfer.
- Incentivization for Everyone: For each transaction of asset or asset service involving RADAR data monetization and Road RADAR Signature Service, each actor in the ecosystem is attributed an incentivization. The asset monetary transaction distributes the commission to Vehicle OEM, Vehicle Users, and RADAR OEM. The necessary monetary benefit along with data certification, anonymity and quality are maintained. Moreover, the RADAR data or RADAR signature service represented as tokens ascend in price appreciation based on usage review submitted to the smart contract.
- Cyclical Economy Monetization: In the proposed architecture, the vehicle owners can monetize their data generated in the form of credits which can be later reclaimed or used in subscribing to any services like insurance and maintenance offered by Vehicle OEM. Also, the advantage for Vehicle OEMs is creation of repeated sales of products and services benefitting the customer as well as the organization subsidizing the cost of building a vehicle connectivity infrastructure. RADAR OEM can improvise their services and products leveraging from the vehicle owners data. The data can be used to train and optimize the services offering an intelligent and custom-tailored solution to each vehicle in the form of traffic advice, road conditions, driving behavior, etc.
4. Architecture Solution
- DataMarket: Decentralized Smart Contract Pallet realized on a Substrate blockchain framework, which orchestrates the entire data monetization transaction subject to hybrid consensus algorithms and its validation.
- Vehicle_OEM_X|Y: Vehicle Original Equipment Manufacturers like Renault, Stellantis or Volkswagen, which provide the OffChain cloud infrastructure and participate as validators in the blockchain consensus. Each participant installs and maintains the blockchain client for data transfer, transaction execution, and liaison of the vehicles.
- Vehicle_M|N: Vehicle Users or Autonomous Fleet devices have a wallet-based account in the blockchain process and provide data necessary for monetization and subscribe to the service offered for enhancing driving.
- RADAR_OEM_A|B: RADAR Original Equipment manufacturers like Continental, Bosch and NXP are interested in the generated RADAR data from the vehicles for enhancing their product offering as well as creating additional business services for customers like road RADAR signature map for their end users.
4.1. Tokenized Non-Fungible Asset Data Set Component
- Data Demand Phase: In the initial phase, when the RADAR_OEM needs the RADAR data, it publishes the demand as an event notification transaction onto the network with the location, vehicle speed and acceptable price limit requirements. Vehicle_OEM can then respond to the event by broadcasting the demand to its vehicle clients for its participation contentment.
- Data Asset Offering and Bid Phase: Vehicle_OEM_X|Y responds individually by creating an ERC 721 token as an asset with asset criteria, floor, and roof price. Then, to maintain bid privacy, RADAR_OEM submits an encrypted bid with the public key of Vehicle_OEM_X|Y, respectively. Then, among the bids, one of RADAR_OEM_As is accepted by Vehicle_OEM_X as the bid of RADAR_OEM_B is not an acceptable price. The fund of RADAR_OEM_A for the bid is transferred to the escrow account as an intermediate transfer which is transferred to Vehicle_OEM_X when the bid criteria and the asset are respected along with necessary certification proof submissions.
- Data Aggregation and Certification: Vehicle_OEM_X starts the data collection job from its set of vehicle clients with the necessary criteria. Vehicles start recording RADAR data and submitting it with its signature hash as proof to the data pool and smart contract. Then, Vehicle_OEM_X processes the data removing the user’s sensitive identification data, submits the final hash proof to the ledger, and generates a data access token to a middleware service to be retrieved later.
- OffChain Data Transfer and OAuth Access: The blockchain OffChain worker component then retrieves the OAuth access token from the middleware and submits it as an API POST request to RADAR_OEM_A if Vehicle_OEM_X submits all the certification proofs to the blockchain.
- Fund Transfer including Finalization of Asset Transfer: Then, the bid amount is transferred by the smart contract from escrow to the Vehicle_OEM_X account and Vehicle_Accounts that participate in this monetization process such that the above condition of the access token and proof are respected.
- Review of transferred Asset: The RADAR_OEM_A that retrieves the processed data from the data pool utilizing the OAuth access token reviews the data. The result of the review process is submitted to the blockchain against the asset ERC721 token issued, which is a fidelity action to augment the price of the data issued by Vehicle_OEM_X in a later transaction or diminish if the review score is bad.
4.2. Tokenized Non-Fungible Asset Data Service Component
- Road RADAR Signature Asset Service Interest Collation: A particular RADAR_OEM_A has to build the localization, additional and planning layers of additional information over the processed data as the road RADAR signature map service publishes an event to the blockchain smart contract of its availability. This is then processed by Vehicle_OEM_X|Y, where each one gathers the interested vehicle clients for subscribing to this service.
- Road RADAR Signature Asset Service Creation and Bidding: RADAR_OEM_A creates an asset service NFT ERC 721 token and the characteristics of the service with the acceptable price range limit.
- Asset Service Subscription and Proof Generation: Then, each of Vehicle_OEM_X or Y expresses the interest in the subscription service on behalf of its clients as an encrypted bid. RADAR_OEM_A accepts the bid of Vehicle_OEM_X along with its vehicles, transferring funds from the vehicles consented to the escrow account.
- OffChain transfer of Asset Service OAuth API Access Token: RADAR_OEM_A generates an OAuth access token for its road RADAR signature map service along with the hash proof of the service including raw data location, vehicle speed and quality. The OffChain worker retrieves the encrypted OAuth access token from the ledger and submits it to Vehicle_OEM_X.
- Fund Transfer and Finalization of Asset Service Transfer: In parallel, the OffChain worker transfers the funds from the escrow for the bid to RADAR_OEM_A. Also, the asset service token ownership is transferred from RADAR_OEM_A to Vehicle_OEM_X and the interested vehicles.
- Review of Transferred Asset Service: The vehicles or Vehicle_OEM concerned for the asset service token can review the improvement in vehicle driving as a result of road RADAR signature map service, positively or negatively connoting its price and fidelity accordingly.
5. Implementation
5.1. Substrate Smart Contract Pallet
5.2. Middleware Components
- AuthController: It is the primary class interface in the middleware which accepts the request and redirects it to the necessary business service layer for storing assets, asset service data, OAuth token generation, as well as encryption and decryption process for the bidding procedure.
- UserController: This authentication service retrieves RADAR and RADAR service subscription data.
- Storage: This is the persistence layer for all the details regarding asset, asset service, authentication, and OAuth access tokens.
- Asset/Asset Service Feedback: The asset or asset service feedback in the form of review is also stored OffChain as additional storage.
- RADAR Data: This contains the data payload representation generated by the vehicle and transacted via the blockchain.
5.3. Infrastructure
6. Evaluation
6.1. Functional Evaluation
- Set of Vehicle_OEM is represented as Vo = {Vo1, Vo2 …VoN} for N participants.
- Set of RADAR_OEM is represented as set of Ro = {Ro1, Ro2 …RoK} for K participants.
- Set of Vehicles are represented as V = {V1, V2 …VM} for M participants.
- Processed RADAR data are monetized by each vehicle belonging to a Vo represented as {D1, D2 …DL} for L successive data assets to be exchanged over the data market.
- Processed road RADAR signature map (RRS), which a RADAR OEM Ro monetizes, represented as {Rs1, Rs2 …RsP} for P successive RRS asset service to be exchanged as a virtuous cycle in the data market.
- Bid either for asset data or asset data service is represented as {Bx,y,1, Bx,y,2 …Bx,y, S}, where x signifies the bidder and y signifies the asset or asset service tendered.
- Certification for any asset data DL, CD is defined as the set of hash signatures derived at each stage of processing steps of individual vehicle data generation, collated raw data in the pool before processing, collated raw data after processing in Vo cloud and then including the OAuth access token.
- Certification for any asset service RSS RsP, CRs is defined as the set of hash signatures derived at each stage of signature formation like initial data state, augmented data after colla ion and improvised RADAR data with signature along with subscription access token.
- Review for any asset or asset service is defined as a score of 0…1 where zero represents the lowest and one represents the highest score; any median value is also possible. This is represented as Ri,j, where i represents the reviewer and j represents the asset or the asset service transferred and reviewed. Based on the review accumulated for asset data DL or asset service RSS RsP, its future issued asset or asset service can have higher or lower pricing based on its reputation score mean RSMRADAR_OEM | Vehicle_OEM. The reputation score for the asset is proportional to the number of certifications received for the asset or the asset service.
- Reputation Score Mean RSMRADAR_OEM | Vehicle_OEM for either Vehicle_ or RADAR OEM is the arithmetic mean of historical reviews accumulated. For example, the Reputation score mean for RADAR_OEM or Vehicle_OEM is represented asRMj = mean(Ri−2,j, Ri−1,j, ... Ri,j)RSMRadar_OEM | Vehicle_OEM = mean(RMj−2, RMj−1,... RMj)
- Global Fairness: Fairness in the case of any application is defined by the work of FairSwap [9]: “A fair exchange protocol allows a seller S to sell a digital commodity D for a fixed price p to a buyer B. The protocol is secure if B only pays if he receives the correct D”. We extend this work in our above design as in [8] for Global Fairness where every participant in our ecosystem has the following guarantees:
- –
- Each participant is ensured that fund transfer for any D happens only when the certification conditions is satisfied for CD.
- –
- Fund transfer is not fulfilled immediately as it is placed in an escrow account and then, on verification, transferred to the destined participant account.
- –
- In either case of the virtuous cycle from RADAR data conversion to road RADAR signature, the facilitators that provision the cloud infrastructure or other value adders can benefit from the commission or fidelity rewards.
- –
- The asset or asset service token exchange results in the influence of a transparent reputation accumulated by either honest or dishonest activity. This is publicly verifiable and can result in the augmenting or decrease in a reputation participant.
- –
- To ensure fair pricing, the sealed bid mechanism is encrypted with the bid receiver’s public key, which results in a competitive remuneration for either the asset or asset service. Also, an asset or asset service issuer’s reputation organically influences the probability of data or data signature pricing.
- Full Interoperability: Our solution achieves two levels of interoperability, intra-chain and inter-chain, as follows:
- –
- Intra-Chain: The above monetization solution can integrate with external agnostic (centralized or decentralized) systems through OffChain workers for API requests or responses. Also, as we utilize a balanced mix of events, signed and unsigned transactions (extrinsic) to differentiate the priority of message passing between each actor in the system, unnecessary overhead in the distributed system, especially of consensus, is avoided.
- –
- Inter-Chain: As it is based on the Substrate framework, this implementation can be extended as a para-chain to integrate with other blockchains or para-chains through Polkadot.
- Chained Data Certification: As mentioned earlier, either for asset RADAR data or asset service road RADAR signature, we ensure the maintenance of the history of the data provenance along with its hash signature-based certification to avoid any counterfeit and validate the health of the data as well as the service.
- Privacy By Design: The solution granularizes the privacy at each level by the following mechanisms:
- –
- Account Pseudonymization: All the accounts are pseudonymized concerning vehicle participants as it is necessary to identify RADAR_OEMs and Vehicle_OEMs. A vehicle’s original identity is another intermediate identity that can be dissociated in case of necessity. This is to respect the forgetting right of GDPR [28] if the vehicle owner decides to remove his information.
- –
- Bid Sealing: All the bids for either asset or asset service are sealed using encryption, which can be decrypted only by the bid receiver, ensuring competitiveness, transparency, and privacy.
- –
- Privacy Data Concealment: RADAR data from the origin in the vehicle until the data reception by the RADAR_OEM is privacy treated with the making of sensitive data, and the hash signature generated is added with controlled noise like location, time, etc., to ensure authenticity.
- Dynamic Pricing: As discussed earlier, pricing is based on a sealed bid oriented proportionally to the participant’s reputation and review of the exchanged asset as well as asset service data tokens to avoid any price manipulation. This ensures a reputation-based adjustment of the asset pricing, creating a virtuous cycle in fidelity and incentivization.
- Accountable Reputation: Reputation, based on the arithmetic mean of reviewing individual assets transferred as extrinsic transactions diffused through the distributed ledger, is verifiable and transparent for all the ecosystem participants.
6.2. Security
- Injection: The blockchain systems can be compromised by using malicious data, which can be in the form of Integer Overflow or Batch Overflow [30]. We ensure adequate checks and balances in the system where external parameterized signed extrinsic are minimized except for encrypted bids.
- Access Control: Each participant of Vehicle_OEM, RADAR_OEM, Vehicle, and escrow is based on access control ensured through permission pallets and privileged calls as they are consortium-based, limiting risk exposure.
- OffChain Manipulation: Our solution communicates with the OffChain only for retrieval and data job scheduling, not as an oracle for knowledge-based decisions that shield against manipulation attacks.
- Sensitive Data Exposure: Data on the blockchain are only related to the original blockchain accounts and their pseudonymized transactions. All the data are managed OffChain, which ensures that sensitive data are hidden and only managed by the blockchain trust mechanism.
- Smart Contract Security: In this application, our decision of smart contract deployment is not external and is compiled along with the node and executed along with its runtime. Further, the ERC 721 token specification of Ethereum is adapted for the Substrate FRAME library, and each logic is separated for token creation, OffChain data orchestration, as well as certification validation, which is tested for performance.
6.3. Experimental Evaluation
- Performance evaluation of the data monetization solution by submitting extrinsic transactions and measuring the throughput of finalized and valid transactions.
- Understanding the hybrid consensus algorithm of AuRa with GRANDPA or BABE with GRANDPA regarding its parameters and deriving an optimum setting, for example, Block Period.
- Analyzing the scalability and fork occurrence in the consensus protocol and evaluating its suitability for enterprise mobility solutions.
6.4. Performance Study
- A client that submits the transaction to the load balancer offloads the transaction to a node chosen based on the round-robin algorithm. The client is based on Javascript, which uses the library of Polkadot.js for the following purposes:
- Establishing WebSocket communication to the Substrate node and retrieving the meta details of the network like pallet information, account details like a nonce, public key identifier and other miscellaneous details like chain information, fork information, block details and transaction details.
- Transaction (extrinsic) construction with the signature using the private key of an account, encoding and decoding transaction payload, submission of the transaction and retrieving its status in the network as finalized or processing.
- For the test, the client submits the transaction of CreateAsset, which creates an asset NFT ERC 721 token in the network. Each transaction has a unique processing complexity based on its purpose logic, affecting its finalization rate or throughput in the network. The test transaction comprises the following complexity:
- Calculation of Hash for generating a unique asset Id. Counted as one operation.
- Storage operation of asset details in asset storage, storing asset count, asset index, and ownership details in separate runtime storage structure either as a storage map, storage double map, or storage value. They are counted as 10 operations.
- Read operation from storage to retrieve the existing details before any update. They are counted as five operations.
- Transactions (extrinsic) are submitted by the multithreaded client of 1000 threads signed by 2000 pre-generated accounts.
- Then, the transaction is submitted asynchronously at an input rate of 1000 transactions per second with 50,000 total transactions repeated in three iterations to avoid any bias in the result. The block size comprising extrinsic transactions is limited to five megabytes which is optimal as small size can induce message overload and bigger size can delay by a processing bottleneck.
- The submitted signed transactions are checked for finalization, and based on block number, the time difference is calculated for estimating the finalized throughput of the network.
6.4.1. Hybrid Consensus Parameter Analysis: Block Period
- Higher interval of block period directly increases the time associated with block production
- Lower or minimal block period decreases the turn-around time for block creation time. Still, it results in a race condition between transaction verification and consensus operation which overflows consensus slot time.
6.4.2. Hybrid Consensus Scalability Analysis
7. Conclusions
- Auction: Auctions can be made more transparent by a commit reveal scheme where there are no time-bound constraints and hidden bids are eventually public when all the bids are received and committed.
- Privacy: Differential privacy [33] can be applied for more granular privacy control on the data ensuring more concealment than the masking techniques. Also, account abstraction based on the ERC 4337 [34] protocol can be considered for anonymizing network participants as an alternative to pseudonymization in our protocol.
- Smart Contract: Smart contract embedded along with client runtime in our Substrate blockchain has better security than by deployment. But it has not been audited using tools like MythX [29], or legal aspects are offloaded, which can be considered in the future.
- Interoperability: Substrate has both intra-chain and inter-chain communication extensibility. The inter-chain exchange in the form of Parachains has several limitations discussed in [35]. They are due to being monetary-based Proof of Stake Polkadot network for parachain slots, validator election transparency issues and governance problems due to the “prime voter” as well as 13 member council governance affecting decentralization. Also, there are liquidity issues with the Polkadot network due to economic reasons, and many parachains like Lido have ceased their operation.
- Certification Proof: Hash-based chained certification can be replaced with Merkle proofs or Zero Knowledge proofs where any user can verify the proof without actual data revealed.
- Consensus: Our implementation analyzes the hybrid consensus of AuRa, BABE, and GRANDPA available in Substrate necessary for our data monetization use case. Another consensus of Nominated Proof of Stake can be tested, available in Substrate Polkadot based on monetary conditions of currency called DOT’S. It can be applied to our use case but needs monetary-based staking and slashing constraints [35].
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Baecker, J.; Engert, M.; Pfaff, M.; Krcmar, H. Business Strategies for Data Monetization: Deriving Insights from Practice. In Proceedings of the 15th International Conference on Wirtschaftsinformatik, Potsdam, Germany, 8–11 March 2020. [Google Scholar]
- Ofulue, J.; Benyoucef, M. Data monetization: Insights from a technology-enabled literature review and research agenda. Manag. Rev. Q. 2022, 74, 521–565. [Google Scholar] [CrossRef]
- Network, D. Digital Infrastructure for Moving Objects (DIMO). Available online: https://docs.dimo.zone/docs (accessed on 7 December 2023).
- Ocean Protocol Foundation. Ocean Protocol: Tools for the Web3 Data Economy. In Handbook on Blockchain; Springer: Cham, Switzerland, 2022. [Google Scholar]
- Avyukt, A.; Ramachandran, G.; Krishnamachari, B. A Decentralized Review System for Data Marketplaces. In Proceedings of the 2021 IEEE International Conference on Blockchain and Cryptocurrency (ICBC), Sydney, Australia, 3–6 May 2021; pp. 1–9. [Google Scholar] [CrossRef]
- Meijers, J.; Putra, G.D.; Kotsialou, G.; Kanhere, S.S.; Veneris, A. Cost-Effective Blockchain-based IoT Data Marketplaces with a Credit Invariant. In Proceedings of the IEEE International Conference on Blockchain and Cryptocurrency (ICBC), Sydney, Australia, 3–6 May 2021; pp. 1–9. [Google Scholar] [CrossRef]
- Al-Sada, B.; Lasla, N.; Abdallah, M. Secure Scalable Blockchain for Sealed-Bid Auction in Energy Trading. In Proceedings of the 2021 IEEE International Conference on Blockchain and Cryptocurrency (ICBC), Sydney, Australia, 3–6 May 2021; pp. 1–3. [Google Scholar] [CrossRef]
- Banerjee, P.; Ruj, S. Blockchain Enabled Data Marketplace—Design and Challenges. arXiv 2018, arXiv:1811.11462. [Google Scholar] [CrossRef]
- Dziembowski, S.; Eckey, L.; Faust, S. FairSwap: How to Fairly Exchange Digital Goods. In Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security, Toronto, ON, Canada, 15–19 October 2018; pp. 967–984. [Google Scholar] [CrossRef]
- FrØland, H.I.; Palm, E.; Kralevska, K.; Gligoroski, D. Web3 and Blockchain for Modernizing the Reseller Market. In Proceedings of the 2023 Fifth International Conference on Blockchain Computing and Applications (BCCA), Kuwait City, Kuwait, 24–26 October 2023; pp. 215–220. [Google Scholar] [CrossRef]
- MHP-Riddle & CODE. The Automotive Sector and Blockchain. Available online: https://dlt.mobi/blockchain-and-the-automotive-sector-by-sebastian-becker-and-katarina-preikschat/ (accessed on 9 December 2023).
- European Telecommunications Standards Institute. Permissioned Distributed Ledger (PDL); Supporting Distributed Data Management. Available online: https://www.etsi.org/technologies/permissioned-distributed-ledgers (accessed on 22 December 2023).
- International Standards Organization. ISO/TC 307 Blockchain and Distributed Ledger Technologies. Available online: https://www.iso.org/-committee/6266604.html (accessed on 7 November 2023).
- European Union Blockchain Observatory & Forum. Blockchain Applications in the Automotive Sector. Available online: https://www.eublock-chainforum.eu/news/blockchain-applications-automotive-sector (accessed on 21 November 2023).
- Continental Automotive. Software-Defined Vehicle. Available online: https://www.continental-automotive.com/en-gl/Passenger-Cars/Technology-Trends/software-defined-vehicles (accessed on 13 January 2024).
- Samuel, C.N.; Glock, S.; Verdier, F.; Guitton-Ouhamou, P. Choice of Ethereum Clients for Private Blockchain: Assessment from Proof of Authority Perspective. In Proceedings of the 2021 IEEE International Conference on Blockchain and Cryptocurrency (ICBC), Sydney, Australia, 3–6 May 2021; pp. 1–5. [Google Scholar] [CrossRef]
- Samue, C.N.; Severine, G.; David, B.; Verdier, F.; Patricia, G.O. Automotive Data Certification Problem: A View on Effective Blockchain Architectural Solutions. In Proceedings of the 2020 11th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON), Virtual, 4–7 November 2020; pp. 0167–0173. [Google Scholar] [CrossRef]
- Gerrits, L.; Samuel, C.N.; Kromes, R.; Verdier, F.; Glock, S.; Guitton-Ouhamou, P. Experimental Scalability Study of Consortium Blockchains with BFT Consensus for IoT Automotive Use Case. In Proceedings of the 19th ACM Conference on Embedded Networked Sensor Systems, Coimbra, Portugal, 15–17 November 2021; pp. 492–498. [Google Scholar] [CrossRef]
- Blair, Tesler (Insight). What Is Privacy by Design and by Default? Available online: https://www.morganlewis.com/pubs/2019/03/the-edata-guide-to-gdpr-what-is-privacy-by-design-and-by-default (accessed on 12 January 2024).
- What Does Data Protection ‘By Design’ and ‘By Default’ Mean? Available online: https://commission.europa.eu/law/law-topic/data-protection/reform/rules-business-and-organisations/obligations/what-does-data-protection-design-and-default-mean%5Fen (accessed on 17 October 2023).
- Hameed, K.; Barika, M.; Garg, S.; Amin, M.B.; Kang, B. A taxonomy study on securing Blockchain-based Industrial applications: An overview, application perspectives, requirements, attacks, countermeasures, and open issues. J. Ind. Inf. Integr. 2022, 26, 100312. [Google Scholar] [CrossRef]
- Parity Technologies. Substrate Technology. Available online: https://substrate.io/technology/ (accessed on 8 January 2024).
- Parity Technologies. Authority Round. Available online: https://paritytech.github.io/substrate/master/sc%consensus%5Faura/index.html (accessed on 20 December 2023).
- Petrowski, J. Blind Assignment for Blockchain Extension (BABE). Available online: https://polkadot.network/blog/polkadot-consensus-part-3-babe (accessed on 17 December 2023).
- David, B.; Gaži, P.; Kiayias, A.; Russell, A. Ouroboros Praos: An Adaptively-Secure, Semi-synchronous Proof-of-Stake Blockchain. In Advances in Cryptology—EUROCRYPT 2018; Springer: Berlin/Heidelberg, Germany, 2018; pp. 66–98. [Google Scholar]
- Stewart, A. Grandpa Finality. Available online: https://research.web3.foundation/en/latest/polkadot/finality.html (accessed on 18 January 2024).
- Bosch. Road Signature. Available online: https://www.bosch-mobility.com/en/solutions/automated-driving/road-signature (accessed on 22 November 2023).
- 2018 Reform of EU Data Protection Rules. Available online: http://eur-lex.europa.eu/legal-content/EN/TXT/PDF/?uri=CELEX:32016R0679 (accessed on 10 December 2023).
- Rateb, J. Blockchain Pour L’internet des véHicules: Une Solution IoT Decentralisee Pour la Communication et le Paiement des Vehicules en Utilisant Ethereum. Doctoral Dissertation, HESAM Université, Paris, France, 2021. [Google Scholar]
- Jabbar, R.; Kharbeche, M.; Al-Khalifa, K.; Krichen, M.; Barkaoui, K. Blockchain for the Internet of Vehicles: A Decentralized IoT Solution for Vehicles Communication Using Ethereum. Sensors 2020, 20, 3928. [Google Scholar] [CrossRef] [PubMed]
- Stack Exchange. BABE GRANDPA Stalled. Available online: https://substrate.stackexchange.com/questions/214/recovering-from-stalled-finality-babe-grandpa (accessed on 11 December 2023).
- Wang, Y. The Adversary Capabilities in Practical Byzantine Fault Tolerance. In Proceedings of the Security and Trust Management: 17th International Workshop, STM 2021, Darmstadt, Germany, 8 October 2021; pp. 20–39. [Google Scholar] [CrossRef]
- Desfontaines, D.; Pejo, B. SoK: Differential privacies. Proc. Priv. Enhancing Technol. 2019, 2020, 288–313. [Google Scholar] [CrossRef]
- Buterin, V.; Weiss, Y.; Gazso, K.; Patel, N.; Tirosh, D.; Nacson, S.; Hess, T. ERC-4337: Account Abstraction Using Alt Mempool. Available online: https://eips.ethereum.org/EIPS/eip-4337 (accessed on 9 January 2024).
- Abbas, H.; Caprolu, M.; Di Pietro, R. Analysis of Polkadot: Architecture, Internals, and Contradictions. In Proceedings of the 2022 IEEE International Conference on Blockchain (Blockchain), Espoo, Finland, 22–25 August 2022; pp. 61–70. [Google Scholar] [CrossRef]
- Samuel, C.N. Connected Car Communication by DLT Technologies: Mobility Service Implementation by Adaptation of Consortium Blockchain Consensus Algorithms. Doctoral Dissertation, Université Côte d’Azur, Nice, France, 2023. [Google Scholar]
- Samuel, C.N.; Verdier, F.; Glock, S.; Guitton-Ouhamou, P. CUBA: An Evolutionary Consortium Oriented Distributed Ledger Byzantine Consensus Algorithm. In Proceedings of the Distributed Computing and Artificial Intelligence, 20th International Conference, Guimaraes, Portugal, 12–14 July 2023; Ossowski, S., Sitek, P., Analide, C., Marreiros, G., Chamoso, P., Rodríguez, S., Eds.; Springer: Cham, Switzerland, 2023; pp. 31–43. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Samuel, C.N.; Verdier, F.; Glock, S.; Guitton-Ouhamou, P. A Fair Crowd-Sourced Automotive Data Monetization Approach Using Substrate Hybrid Consensus Blockchain. Future Internet 2024, 16, 156. https://doi.org/10.3390/fi16050156
Samuel CN, Verdier F, Glock S, Guitton-Ouhamou P. A Fair Crowd-Sourced Automotive Data Monetization Approach Using Substrate Hybrid Consensus Blockchain. Future Internet. 2024; 16(5):156. https://doi.org/10.3390/fi16050156
Chicago/Turabian StyleSamuel, Cyril Naves, François Verdier, Severine Glock, and Patricia Guitton-Ouhamou. 2024. "A Fair Crowd-Sourced Automotive Data Monetization Approach Using Substrate Hybrid Consensus Blockchain" Future Internet 16, no. 5: 156. https://doi.org/10.3390/fi16050156
APA StyleSamuel, C. N., Verdier, F., Glock, S., & Guitton-Ouhamou, P. (2024). A Fair Crowd-Sourced Automotive Data Monetization Approach Using Substrate Hybrid Consensus Blockchain. Future Internet, 16(5), 156. https://doi.org/10.3390/fi16050156