We Can Define the Domain of Information Online and Thus Globally Uniformly
Abstract
:1. Introduction
2. Basics and Recommended Method for Global Definition of Digital Information
- Digital bits first encode numbers (Figure 1). The software must know the binary format of the numbers.
- Each number represents a selection from an ordered set or “domain” (Figure 1).
- A sequence of multiple numbers selects from multiple domains or a “multidimensional” domain (which is ordered along multiple dimensions). It is the domain of the numbers and so the domain of this piece of digital information.
- The elements of the domain can represent very different things, e.g., control codes for software, letters, indices, e.g., on entries of a table, of course also measurable values (Figure 2) and other data.
- (a)
- The original source of information (e.g., observation, measurement) should be reproducible.
- (b)
- We create a list of all independent, reproducible features (e.g., measurements, properties) that could be of interest to this application.
- (c)
- We try to search, reference, and reuse existing online definitions for quantifying each listed feature. If no suitable online definition is available, we must create a new online definition of our own. This definition can use and describe the natural order of the feature and select a natural unit, or it can define a new order of feature variants so that they are reproducibly (one-to-one resp. bijectively) mapped to the domain of a number (quantification). Then, they are selectable by the number (which digitally represents the feature). The defined order of application-specific features can later be automatically used for similarity comparison and search by universal numerical search engines [14].
- (d)
- The digital representations (numbers) of all listed features are uniquely combined and nested to form the number sequence that digitally conveys the information for that application.
- (e)
- All necessary definitions (of the features, of the representing numbers, of their combination, of the digital format of each number) are put online in standardized machine-readable form. The online definition starts with a version number of the standard and an efficient machine-readable specification of the binary format of the number sequence. This includes short links to the exact definitions of each number. The exact definition may be very detailed and may also include links to multilingual and multimedia explanations and to (the source code of) software for handling the number sequence. Online definitions can be reused and nested in subsequent online definitions. This enables the definition and handling of complex data structures.
- (f)
- After the online definition, the defined information can be reproducibly digitized and transported as DV in the form (1): “UL plus number sequence”, where UL efficiently refers to the online definition.
3. Example of Results
4. Discussion
4.1. Short Conclusive Argumentation
- Information (e.g., digital information) is a selection (2) from an ordered set. I have called this set the “domain” of information. It is clear that numbers are selections from ordered sets and that digital bits encode sequences of numbers.
- Therefore, for a direct comparison (e.g., to check similarities in a similarity search), the domain (2) of the compared pieces of digital information (number sequences) should be the same.
- Therefore, it is consequent to consider number sequences as building blocks of digital information and to define the domain of a number sequence and its binary format globally. In the current infrastructure, this can be performed efficiently by a machine-readable online definition of digital information (number sequence).
- The proposed DV data structure (1) simply represents such a number sequence, preceded by the UL, which is required as a global identifier and efficient pointer to the online definition. To save energy and resources, the binary format of the DV (UL and number sequence) is optimized for efficiency.
- Details about a possible format of the UL were published [12]. Additional data are not necessary for the DV and were avoided. This is relevant for both efficiency and global uniqueness. In contrast, the (globally unique) online definition of the DV may contain efficient links to additional detailed descriptions and explanations.
4.2. More Detailed Discussion
4.3. The Domain of Information Is Essential
4.4. About Existing Concepts of Information—Rethinking Is Necessary
4.5. Disadvantages of the New DV Data Structure
- Today still unfamiliar even for experts
- Initial effort
- Online definition is unusual up to now and easy to underestimate
- Large project which initially requires investment
4.6. Advantages of the New DV Data Structure
- Global digital information including definition
How is it ensured globally that every communication participant with software knows the format and definition of the exchanged digital information (number sequence)?
- Global digital information including identification
- Globally reproducible and precise digital information
- Globally comparable and searchable digital information
- Global digital information representation optimized for the application
- Global language-independent information transport
- Digital information for global AI and machine learning
- Digital information for global programming
- Digital information for efficient global work-sharing
- Digital information optimized for efficiency
- Rethinking (digital) information
- Semantics (combination of digital information) are better reproducible
- Improving international communication
4.7. This Has an Impact on My World View
5. Conclusions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Orthuber, W.; Fiedler, G.; Kattan, M.; Sommer, T.; Fischer-Brandies, H. Design of a global medical database which is searchable by human diagnostic patterns. Open Med. Inform. J. 2008, 2, 21–31. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Orthuber, W.; Sommer, T. A Searchable Patient Record Database for Decision Support. Medical Informatics in a United and Healthy Europe. In Proceedings of the MIE 2009, The XXIInd International Congress of the European Federation for Medical Informatics, Sarajevo, Bosnia and Herzegovina, 30 August–2 September 2009; pp. 584–588. [Google Scholar]
- Orthuber, W.; Dietze, S. Towards Standardized Vectorial Resource Descriptors on the Web. In Proceedings of the Informatik 2010: Service Science—Neue Perspektiven für die Informatik, Beiträge der 40. Jahrestagung der Gesellschaft für Informatik e.V. (GI), Band 2, Leipzig, Germany, 27 September–1 October 2010; pp. 453–458. [Google Scholar]
- Orthuber, W.; Papavramidis, E. Standardized vectorial representation of medical data in patient records. Med. Care Compunetics 2010, 6, 153–166. [Google Scholar]
- Orthuber, W. Collection of Medical Original Data with Search Engine for Decision Support. In Exploring Complexity in Health: An Interdisciplinary Systems Approach; IOS Press: Amsterdam, The Netherlands, 2016; pp. 257–261. [Google Scholar]
- Orthuber, W.; Hasselbring, W. Proposal for a New Basic Information Carrier on the Internet: URL Plus Number Sequence. In Proceedings of the 15th International Conference WWW/Internet, Mannheim, Germany, 28–30 October 2016; pp. 279–284. [Google Scholar]
- Orthuber, W. Online definition of comparable and searchable medical information. Digit. Med. 2018, 4, 77–83. [Google Scholar] [CrossRef]
- Orthuber, W. Global predefinition of digital information. Digit. Med. 2018, 4, 148–156. [Google Scholar] [CrossRef]
- Orthuber, W. Information Is Selection-A Review of Basics Shows Substantial Potential for Improvement of Digital Information Representation. Int. J. Environ. Res. Public Health 2020, 17, 2975. [Google Scholar] [CrossRef] [PubMed]
- Orthuber, W. How to make medical information comparable and searchable. Digit. Med. 2020, 6, 1–8. [Google Scholar] [CrossRef]
- Orthuber, W. Internet since decades—but no globally searchable data. GJETA 2020, 4, 70–72. [Google Scholar]
- Orthuber, W. Reproducible Transport of Information. In Studies in Health Technology and Informatics; IOS Press: Amsterdam, The Netherlands, 2021; Volume 281, pp. 3–7. [Google Scholar]
- Orthuber, W. The Building Blocks of Information Are Selections–Let’s Define Them Globally! In pHealth; IOS Press: Amsterdam, The Netherlands, 2021; pp. 76–81. [Google Scholar]
- Orthuber, W. Demonstration of Numeric Search in User Defined Data. Available online: http://www.numericsearch.com (accessed on 10 March 2022).
- Lee, D.; Chu, W.W. Comparative Analysis of Six XML Schema Languages. ACM Sigmod Rec. 2000, 29, 76–87. [Google Scholar] [CrossRef] [Green Version]
- Tekli, J.; Chbeir, R.; Yetongnon, K. An overview on XML similarity: Background, current trends and future directions. Comput. Sci. Rev. 2009, 3, 151–173. [Google Scholar] [CrossRef]
- World Wide Web Consortium. W3C: Resource Description Framework, W3C Recommendation. 2004. Available online: http://www.w3.org/RDF/ (accessed on 29 March 2022).
- Murray-Rust, P. Open data in science. Nat. Preced. 2008, 1. Available online: https://www.nature.com/articles/npre.2008.1526.1 (accessed on 29 March 2022).
- Dirac, P.A.M. The Principles of Quantum Mechanics (No. 27); Oxford University Press: Oxford, UK, 1981. [Google Scholar]
- Zezula, P.; Amato, G.; Dohnal, V.; Batko, M. Similarity Search: The Metric Space Approach; Springer Science & Business Media: New York, NY, USA, 2006; Volume 32. [Google Scholar]
- Aliprand, J.M. The unicode standard. Libr. Resour. Tech. Serv. 2011, 44, 160–167. [Google Scholar] [CrossRef] [Green Version]
- Henderson, T.; Shepheard, J.; Sundararajan, V. Quality of diagnosis and procedure coding in ICD-10 administrative data. Med. Care 2006, 44, 1011–1019. [Google Scholar] [CrossRef] [PubMed]
- Southern, D.; Eastwood, C.; Quan, H.; Ghali, W. Enhancing description of hospital-conditions with ICD-11 cluster coding: Better codes for monitoring and prevention. Int. J. Popul. Data Sci. 2018, 3, 4. [Google Scholar] [CrossRef]
- Benson, T.; Grieve, G. Principles of Health Interoperability: SNOMED CT, HL7 and FHIR; Springer: Berlin/Heidelberg, Germany, 2016. [Google Scholar]
- Saripalle, R.; Runyan, C.; Russell, M. Using Hl7 FHIR to achieve interoperability in patient health record. J. Biomed. Inform. 2019, 94, 103188. [Google Scholar] [CrossRef] [PubMed]
- Josefsson, S. The Base16, Base32, and Base64 Data Encodings. Available online: https://www.hjp.at/doc/rfc/rfc4648.html (accessed on 11 March 2022).
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Orthuber, W. We Can Define the Domain of Information Online and Thus Globally Uniformly. Information 2022, 13, 256. https://doi.org/10.3390/info13050256
Orthuber W. We Can Define the Domain of Information Online and Thus Globally Uniformly. Information. 2022; 13(5):256. https://doi.org/10.3390/info13050256
Chicago/Turabian StyleOrthuber, Wolfgang. 2022. "We Can Define the Domain of Information Online and Thus Globally Uniformly" Information 13, no. 5: 256. https://doi.org/10.3390/info13050256
APA StyleOrthuber, W. (2022). We Can Define the Domain of Information Online and Thus Globally Uniformly. Information, 13(5), 256. https://doi.org/10.3390/info13050256