Design of Chained Document HTML Generation Technique Based on Blockchain for Trusted Document Communication
Abstract
:1. Introduction
2. Related Research
2.1. PDF and HTML
2.2. Document HTML
- (a)
- The encoding of HTML documents should be UTF-8.
- (b)
- All related resources such as an image for an HTML document must be embedded and do not compose the document by linking external resources. HTML documents have to be single files like PDF and do not allow linking to external resources.
- (c)
- Do not use <audio>, <video> multimedia tags. Audio and video tags are difficult to be included in a document due to the file size problem, and these tags are not essential as a document. Therefore, the <audio> and <video> multimedia tags, which are multimedia tags provided by HTML5, are not used.
- (d)
- Do not use external resources container tags which are <object>, <iframe>, <embed>, <param>. The external resource container tags are hard to be embedded in a single document, and the function of the tags can be shown depending on the device. Moreover, these tags are not an essential element as a document. Therefore, <object>, <iframe>, <embed>, <param> tags, which are external resources container tags, are not used.
- (e)
- Do not allow asynchronous data loading. After the HTML document is opened, it is not allowed to load data by an asynchronous call method using a scripting language such as JavaScript jQuery library. Data itself is not included with asynchronous data loading in the HTML document, and HTML document content can be different based on data loading. The asynchronous data loading can be a vulnerability to the content integrity of the HTML document.
- (f)
- The content of the HTML document must be digital signed. The digital signed hash value and the digital signature public key must also be included so that they can be verified.
2.3. Blockchain
3. Chained Document HTML Generation Technique
3.1. Chained Document HTML Conformance
- All related resources must be in an HTML document internally;
- An HTML document can be opened in a standard web browser without any additional software.
3.2. Design of the Chained Document HTML Structure
3.3. Design of the Chained Document Blockchain Structure
3.4. Chained Document HTML Generation and Verification Process
4. Experimental Verification
4.1. Experiment Environment
4.2. Sample HTML Document for Experiment
4.3. Generate Chained Document HTML
4.4. Verify Chained Document HTML
5. Comparison
5.1. PDF vs. HTML vs. Document HTML vs. Chained Document HTML
5.2. File Size Overhead
6. Conclusions
Author Contributions
Funding
Informed Consent Statement
Acknowledgments
Conflicts of Interest
References
- Sprague, R.H., Jr. Electronic document management: Challenges and opportunities for information systems managers. MIS Q. 1995, 19, 29–49. [Google Scholar] [CrossRef]
- Kim, H.C. Issues and Subjects of the Framework Act on Electronic Document and Electronic Commerce. Law Stud. 2012, 15, 293–322. [Google Scholar]
- DocuSign. Available online: https://www.docusign.com/ (accessed on 20 November 2021).
- E-Document Integrated Support Center. Available online: https://www.npost.kr/ (accessed on 20 November 2021).
- Hwang, H.C.; Kim, W.J. Design of Document-HTML Generation Technique for Authorized Electronic Document Communication. J. Soc. Korea Ind. Syst. Eng. 2021, 44, 51–59. [Google Scholar] [CrossRef]
- Warnock, J.E.; Geschke, C. Founding and Growing Adobe Systems, Inc. IEEE Ann. Hist. Comput. 2019, 41, 24–34. [Google Scholar] [CrossRef]
- Staar, P.W.; Dolfi, M.; Auer, C.; Bekas, C. Corpus conversion service: A machine learning platform to ingest documents at scale. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, London, UK, 19–23 August 2018; pp. 774–782. [Google Scholar]
- Rohlmann, S.; Mladenov, V.; Mainka, C.; Schwenk, J. Breaking the Specification: PDF Certification. In Proceedings of the 2021 IEEE Symposium on Security and Privacy (SP), San Francisco, CA, USA, 24–27 May 2021; pp. 1485–1501. [Google Scholar]
- W3C. HTML 5.2. Available online: https://www.w3.org/TR/html52/ (accessed on 20 November 2021).
- Kim, S.K.; Huh, J.H. Artificial Neural Network Blockchain Techniques for Healthcare System: Focusing on the Personal Health Records. Electronics 2020, 9, 763. [Google Scholar] [CrossRef]
- Radanliev, P.; De Roure, D.; Burnap, P.; Santos, O. Epistemological equation for analysing uncontrollable states in complex systems: Quantifying cyber risks from the internet of things. Rev. Socionetw. Strateg. 2021, 15, 381–411. [Google Scholar] [CrossRef]
- Abdi, A.I.; Eassa, F.E.; Jambi, K.; Almarhabi, K.; Khemakhem, M.; Basuhail, A.; Yamin, M. Hierarchical Blockchain-Based Multi-Chaincode Access Control for Securing IoT Systems. Electronics 2022, 11, 711. [Google Scholar] [CrossRef]
- Masinter, L. The “data” URL scheme. In RFC 2397; The Internet Society: Reston, VA, USA, 1998. [Google Scholar]
- LetsEncrypt. Certificate Authority Providing TLS Certificates. Available online: https://letsencrypt.org/ (accessed on 20 November 2021).
- Dasso, A.; Funes, A.; Riesco, D.; Montejano, G. Computing Power, Key Length and Cryptanalysis. An Unending Battle? arXiv 2020, arXiv:2011.00985. [Google Scholar]
- Etherscan. Available online: https://etherscan.io (accessed on 20 November 2021).
Public Blockchain | Private Blockchain |
---|---|
Open network | Closed network |
Anyone can join the network | The user who is invited can join the network |
Anonymous | Identified |
Bitcoin, Etherium | HyperLegder |
Anonymous | Nonymous |
Bitcoin, Ethereum | Hyperledger |
Key | Description |
---|---|
TIMESTAMP | Timestamp for the document creation |
CREATOR | Document creator |
DOCUMENT-HTML-CONTENT-DIGEST | Hash digest value of the HTML content |
CERTIFICATE | Public key for the document-HTML-content-signed-digest |
OS | Ubuntu 20.04.2 LTS 64bit |
Geth | 1.10.4-stable |
OpenSSL | OpenSSL 1.1.1f |
Certificate | Let’s Encrypt SSL Certificate 2048bit key |
Python | Python 3.8.8 |
Web3 |
Tag | Count | Tag | Count | Tag | Count | Tag | Count |
---|---|---|---|---|---|---|---|
td | 15 | li | 3 | form | 1 | h2 | 1 |
th | 9 | hr | 3 | table | 1 | thead | 1 |
div | 9 | h4 | 2 | head | 1 | tbody | 1 |
p | 6 | script | 1 | style | 1 | html | 1 |
meta | 5 | title | 1 | footer | 1 | ||
tr | 4 | img | 1 | body | 1 | ||
a | 4 | link | 1 | ul | 1 |
Type | Description |
---|---|
Image | PNG logo |
Style Sheets | Bootstrap CSS |
JavaScript | jQuery JS library |
HTML | Document HTML | Chained Document HTML | ||
---|---|---|---|---|
HTML compatibility | Incompatible | Compatible | Compatible | Compatible |
Structure | Single file with internal resources | Multiple files with external resources | Single file with internal resources | Single file with internal resources |
Resources format | Any resources on PDF specification | Any resources on HTML specification | Any resources with Data URI scheme | Any resources with Data URI scheme |
HTML tags | Nonsupport | Any HTML tags under HTML specification | Don’t allow to use iframe, object, video, audio | Don’t allow to use iframe, object, video, audio |
Data loading by scripting | Allow | Allow | Don’t Allow | Don’t Allow |
Metadata in the document | XMP, Pieceinfo | - | Content digest, Signed digest, Certificate | Content digest, Signed digest, Transaction address |
Metadata in the blockchain | - | - | - | Timestamp Creator Content digest Certificate |
User experience on a mobile | Bad readability, No interaction | Good readability, Responsive | Good readability, Responsive | Good readability, Responsive |
Verification content integrity | Verification by a digital signature | Nonsupport | Verification by document HTML metadata | Verification by chained document HTML metadata |
Verification content delivery | Nonsupport | Nonsupport | Nonsupport | Verification by chained document HTML blockchain metadata |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Hwang, H.-C.; Kim, W.-J. Design of Chained Document HTML Generation Technique Based on Blockchain for Trusted Document Communication. Electronics 2022, 11, 1006. https://doi.org/10.3390/electronics11071006
Hwang H-C, Kim W-J. Design of Chained Document HTML Generation Technique Based on Blockchain for Trusted Document Communication. Electronics. 2022; 11(7):1006. https://doi.org/10.3390/electronics11071006
Chicago/Turabian StyleHwang, Hyun-Cheon, and Woo-Je Kim. 2022. "Design of Chained Document HTML Generation Technique Based on Blockchain for Trusted Document Communication" Electronics 11, no. 7: 1006. https://doi.org/10.3390/electronics11071006
APA StyleHwang, H.-C., & Kim, W.-J. (2022). Design of Chained Document HTML Generation Technique Based on Blockchain for Trusted Document Communication. Electronics, 11(7), 1006. https://doi.org/10.3390/electronics11071006