Design of Enhanced Document HTML and the Reliable Electronic Document Distribution Service
Abstract
:1. Introduction
2. Related Research
2.1. PDF
2.2. HTML
2.3. Content Integrity
2.4. Digital Signature
2.5. Document HTML and Chained Document
3. Enhanced Document HTML
3.1. Improvement of Document HTML
3.2. Definition of Extended Document HTML
- (a)
- Extended Document HTML must have a DOCTYPE declaration, as shown in Figure 7;
- (b)
- Extended Document HTML uses UTF-8 encoding;
- (c)
- All resources must be embedded, and external resources are not allowed. The data URL Scheme in RFC 2397 is used to convert resources to internal resources, as shown in Figure 8;
- (d)
- The @import function is not allowed in CSS. The @import function can link to an external CSS, and it causes vulnerability in terms of content integrity. Therefore, it is not allowed;
- (e)
- Action script, such as JavaScript, is not allowed. The script function can load the data from an external source, and it causes vulnerability in terms of content integrity. Therefore, it is not allowed;
- (f)
- A multimedia tag, such as <audio> or <video>, is not allowed. A multimedia tag is not essential to present content in a document. These tags are not essential in terms of a document perspective, and they could cause a file size problem. Therefore, they are not allowed;
- (g)
- The <iframe> tag is not allowed. An <iframe> tag links to content from an external location, and the content is not part of the document. As such, it causes vulnerability in terms of content integrity. Therefore, it is not allowed;
- (h)
- An external resources container, such as <object>, <embed>, or <param>, is not allowed. These tags allow links to non-HTML objects. These tags contain device or OS-dependent values, making it difficult to embed them. Therefore, they are not allowed;
- (i)
- An Extended Document HTML meta tag must be included to have a content integrity verification feature, as shown in Figure 5. The <ds-range> tag indicates the byte area in the Document HTML, which is needed to have content integrity. The <ds-digest> tag has the message digest value using a hash function for the area. The <ds-signed-digest> tag has the signed message digest value from the <ds-digest> value using a PKI certificate. The <ds-cert> tag has the public key and the certificate information to verify the <ds-signed-digest> value.
4. Design of Electronic Document Distribution Service Based on Enhanced Document HTML
4.1. Certified Electronic Document Intermediary
4.2. Electronic Document Distribution with Document HTML
5. Experimental Verification
5.1. HTML Electronic Document for Experiment
5.2. Generation of Document HTML
5.3. Verification of Document HTML
6. Discussion and Limitations
7. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Hwang, H.C.; Kim, W.J. Design and Implementation of Document-HTML System for an Authorized Electronic Document Communication. J. Adv. Eng. Technol. 2021, 14, 61–73. [Google Scholar]
- Hwang, H.C.; Kim, W.J. Design of Chained Document HTML Generation Technique Based on Blockchain for Trusted Document Communication. Electronics 2022, 11, 1006. [Google Scholar] [CrossRef]
- Warnock, J.E.; Geschke, C. Founding and Growing Adobe Systems, Inc. IEEE Ann. Hist. Comput. 2019, 41, 24–34. [Google Scholar] [CrossRef]
- PDF Association. PDF Specification Index. 2022. Available online: https://bit.ly/3bpyZV1 (accessed on 25 May 2022).
- HTML5. 2022. Available online: https://html.spec.whatwg.org/ (accessed on 25 May 2022).
- Lee, B.-K. HTML specification and semantics analysis of Korean news sites. J. Digit. Contents Soc. 2017, 18, 949–959. [Google Scholar]
- Kaczmarczyk, A.; Zabierowski, W. The Comparison of Native and Hybrid Mobile Applications for Android System. In Proceedings of the 2021 28th International Conference on Mixed Design of Integrated Circuits and System, IEEE, Lodz, Poland, 24–26 June 2021; pp. 290–293. [Google Scholar]
- Long, S. A Comparative Analysis of the Application of Hashing Encryption Algorithms for MD5, SHA-1, and SHA-512. In Journal of Physics: Conference Series; IOP Publishing: Bristol, UK, 2019; Volume 1314, p. 012210. [Google Scholar]
- Jun-Ho, S.; Sung-Su, K.; Seog, J.M. Diffie-Hellman Based Asymmetric Key Exchange Method Using Collision of Exponential Subgroups. Korea Information Processing Society. Softw. Data Eng. 2020, 9, 39–44. [Google Scholar]
- Dasso, A.; Funes, A.; Riesco, D.; Montejano, G. Computing Power, Key Length and Cryptanalysis. An Unending Battle? arXiv 2020, arXiv:2011.00985. [Google Scholar]
- E-Document Integration Support Center. Certified Electronic Document Intermediary. Available online: https://bit.ly/3y1shw8 (accessed on 25 May 2022).
- Górski, T. UML Profile for Messaging Patterns in Service-Oriented Architecture, Microservices, and Internet of Things. Appl. Sci. 2022, 12, 12790. [Google Scholar] [CrossRef]
- Petrasch, R.J.; Petrasch, R.R. Data Integration and Interoperability: Towards a Model-Driven and Pattern-Oriented Approach. Modelling 2022, 3, 105–126. [Google Scholar] [CrossRef]
Type | Purpose | Key Characteristic |
---|---|---|
PDF/A | Long term archive |
|
PDF/X | Digital print |
|
PDF/UA | Accessibility for visually impaired |
|
Conformance | Description |
---|---|
Document encoding | UTF-8 |
Resources | All resources must be embedded to be a single file, using Data URL Scheme in RFC 2397. |
Multimedia tags | Multimedia tags, such as <audio> and <video>, are not allowed as these are not essential in terms of a document perspective, and they could cause a file size problem. |
External resources container tags | <iframe>, <object>, <embed>, and <param> are not allowed as they bring content from an external location, and they could cause vulnerability in terms of content integrity. |
Asynchronous data loading | Asynchronous data loading using script action is not allowed as it can change the content, and it could cause vulnerability in terms of content integrity. |
Digital signature | The content must be digitally signed using PKI certificate. |
tag | “ds-range” |
content | “Byte Position” |
content format | “[0-9a-f]{8}[0-9a-f]{8}[0-9a-f]{8}[0-9a-f]{8}” |
example | <meta name=“ds-range” content=“00000000000000FF00000AFF00000BFF”/> |
URL | Type | Description |
---|---|---|
/jsonAction.do?actionId=action_id&tx_id=tx_id&token=tokenID | text/html | Main HTML |
/img/rn/tmpl/10/2021/20220420_dt06.png | image/png | Common image |
/img/rn/tmpl/10/2021/20220420_dt05.png | image/png | Common image |
/img/rn/tmpl/10/2021/20220420_dt04.png | image/png | Common image |
/img/rn/tmpl/10/2021/20220420_dt03.png | image/png | Common image |
/img/rn/tmpl/10/2021/20220420_dt01.png | image/png | Common image |
/img/rn/tmpl/10/2021/20220420_bot.jpg | image/png | Common image |
/img/rn/tmpl/10/2021/20220420_at02.png | image/png | Common image |
/img/cm/tmpl/img_tel.png | image/png | Common image |
/img/cm/css/btn/btn_top.png | image/png | Common image |
/img/cm/css/bg/ico_phone.png | image/png | Common image |
/js/comm/jquery/jquery-ui.css | text/css | Style Sheet |
/js/comm/iSwiper/swiper.min.css | text/css | Style Sheet |
/css/comm/ntstb/styleTb.css | text/css | Style Sheet |
/css/comm/ntstb/commonTb.css | text/css | Style Sheet |
/css/comm/NtsCommonTb.css | text/css | Style Sheet |
/css/comm/cm_style.css | text/css | Style Sheet |
Tag | Count | Tag | Count | Tag | Count | Tag | Count |
---|---|---|---|---|---|---|---|
p | 81 | col | 20 | caption | 8 | title | 1 |
td | 79 | br | 19 | colgroup | 7 | link | 1 |
tr | 61 | strong | 15 | dt | 4 | html | 1 |
div | 58 | ul | 10 | dd | 4 | head | 1 |
th | 46 | tbody | 8 | thead | 3 | em | 1 |
li | 37 | table | 8 | meta | 3 | button | 1 |
span | 21 | img | 8 | dl | 3 | body | 1 |
Item | Result |
---|---|
The converted external resources to the internal resources | 11 resources (1 style sheet and 10 images) |
File size of the original tax notification document | 1,833,435 bytes |
File size of the tax notification document as an image format for download purposes | 2,414,571 bytes |
File size of the Document HTML | 2,462,018 bytes |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Hwang, H.-C.; Kim, W.-J. Design of Enhanced Document HTML and the Reliable Electronic Document Distribution Service. Electronics 2023, 12, 2176. https://doi.org/10.3390/electronics12102176
Hwang H-C, Kim W-J. Design of Enhanced Document HTML and the Reliable Electronic Document Distribution Service. Electronics. 2023; 12(10):2176. https://doi.org/10.3390/electronics12102176
Chicago/Turabian StyleHwang, Hyun-Cheon, and Woo-Je Kim. 2023. "Design of Enhanced Document HTML and the Reliable Electronic Document Distribution Service" Electronics 12, no. 10: 2176. https://doi.org/10.3390/electronics12102176
APA StyleHwang, H.-C., & Kim, W.-J. (2023). Design of Enhanced Document HTML and the Reliable Electronic Document Distribution Service. Electronics, 12(10), 2176. https://doi.org/10.3390/electronics12102176