Testing and Validating the Suitability of Geospatially Informed Proxies on Land Tenure in North Korea for Korean (Re-)Uniﬁcation

: The role of remote sensing data in detecting, estimating, and monitoring socioeconomic status (SES) such as quality of life dimensions and sustainable development prospects has received increased attention. Geospatial data has emerged as powerful source of information for enabling both socio-technical assessment and socio-legal analysis in land administration domain. In the context of Korean (re-)uniﬁcation, there is a notable paucity of evidence how to identify unknowns in North Korea. The main challenge is the lack of complete and adequate information when it comes to clarifying unknown land tenure relations and land governance arrangements. Deriving informative land tenure relations from geospatial data in line with socio-economic land attributes is currently the most innovative approach. In-close and in-depth investigations of validating the suitability of a set of geospatially informed proxies combining multiple values were taken into consideration, as were the forms of knowledge co-production. Thus, the primary aim is to provide empirical evidence of whether proposed proxies are scientiﬁcally valid, policy-relevant, and socially robust. We revealed differences in the distributions of agreements relating to land ownership and land transfer rights identiﬁcation among scientists, bureaucrats, and stakeholders. Moreover, we were able to measure intrinsic, contextual, representational, and accessibility attributes of information quality regarding the associations between earth observation (EO) data and land tenure relations in North Korea from a number of different viewpoints. This paper offers valuable insights into new techniques for validating suitability of EO data proxies in the land administration domain off the reliance on conventional practices formed and customized to the speciﬁc artefacts and guidelines of the remote sensing community.


Introduction
The role of remote sensing data in detecting, estimating, and monitoring socioeconomic status (SES) such as quality of life dimensions and sustainable development prospects has received increased attention across number of disciplines in recent years [1][2][3][4][5]. Geospatial thinking and technology have provided an important opportunity to advance the understanding of the specific questions which drives SES perspectives that include humanitarian health [6,7], rural household poverty [4,8], neighborhood deprivation [9], valuation of land and property [10], and urban dynamics [3]. This argument has given rise to much debate on not only making an association of remotely sensed spatial data and socio-economic parameters but also predicting or interpreting them.
Land administration deals with the people-to-land relationship, by describing, analyzing, designing, and measuring their relations that include social, economic, spatial, legal, and engineering perspectives [11]. Along with the growth of remote sensing applications in SES, there has been growing recognition and evidence of the vital links between In general, gathering information regarding land tenure from multiple non-human resources such as legislation, policy documents and case studies is possible. However, the main challenge faced by many decision-makers is the incompleteness and inadequacy (inadequate proof or evidence) of information when it comes to clarifying how land tenure relations and land governance arrangements are really constructed and maintained in North Korea. In the light of these unknowns, additional data from multiple stakeholders is valuable for obtaining a more detailed insight into the broad spectrum of personal experience, views, and judgements [43,44]. In this respect, North Korean refugees can also act as human capital, not only as a supplement to publications on North Korea but also as a way of conducting an empirical analysis of primary data on North Korea [45].
When it comes to abnormal circumstances for gathering empirical evidence in this study, it is necessary to explain the main challenges and most common practices. More recent attention in urban planning and land management of North Korea has focused on using focus group interview (FGI) methods with North Korean refugees and spatial analysis with Google Earth images (EO data). Notwithstanding the fact that these methods seem to be most feasible and effective, the current research on land tenure and land governance in North Korea has been impeded by the lack of empirical data, rigorous methodologies, and reliability and validity of information for the in-close and in-depth investigation. Therefore, in addition to methods commonly used, such a content analysis on internal documents in North Korea and joint expert consultation processes make up for a dearth of evidence-based knowledge base [42,46].
Adopting a similar position aforementioned, findings from a previous study demonstrate how a process of socializing the pixel (combining with land administrative and geospatial engineering approaches) can take place through (re-)interpreted semantic land tenure relations [21]. As a result, GIA has been proposed based on a mixed-methods design and an information fusion approach, to construct a strong and consistent association between land tenure and EO data. However, a further investigation into the validation of elaborated meaning and interpreted information throughout extensive consultations with outside experts and multiple stakeholders needs to be undertaken before the association between land tenure and EO data can be more clearly understood in line with algorithmic approaches. Thus, the primary aim of this paper is to provide empirical evidence for the claim that it is possible to standardize the identification and categorization of certain objects, environments, and semantics visible in EO data that can be used to (re-)interpret land tenure relations.
Given our awareness of critical consequences in misidentifying geospatially informed proxies due to the lack of appropriate and rigorous validation of suitability, a threefold approach to empirical knowledge elicitation for GIA is taken, comprising (1) extracting scientific knowledge with a high-level of expertise based on topical (i.e., land tenure and land administration), methodological (i.e., remote sensing and earth observation), and contextual (i.e., Korean (re-)unification) interests; and (2) identifying bureaucratic knowledge (i.e., government officials); and (3) deriving knowledge of local communities in geographic areas of interest from people (i.e., North Korean refugees) who have the most accurate understanding (through familiarity or personal experience) of land tenure relations, land governance and land use practices. This study seeks to answer the following specific research questions:

•
To what extent does scientific, bureaucratic and stakeholder knowledge agree or disagree with a set of identified pixel-based proxies related to land tenure in North Korea? • How does a knowledge co-production process help to validate suitability of geospatially informed proxies and become legitimate land tenure knowledge?
This paper comprises four sections. The first deals with the conceptual and methodological accounts of research and analysis. The following section brings together the key findings relating to proxy identification and the measurement of information quality. The remaining sections of the paper comprises a summary and discussion of the findings and further implications for future research, respectively.

Validating the Suitability of Geospatially Informed Proxies
Validating the suitability of the remote sensing data and products against the social context of the location is critical. However, despite the explosive growth in the use of remote sensing in a wide range of applications in many different fields, there is increasing concern about the lack of rigorous social and contextual validation methods and techniques, which, in turn, may result in the misidentification or misinterpretation of proxies. Therefore, to ensure better-informed geospatial analysis, it is necessary to consider not only the procedure and legitimacy of validating the suitability of relevant proxies and combining multiple values through knowledge co-production processes.

On the Need of Tailored Approaches to EO Data Validation
"Validation" is a term frequently used in the remote sensing literature, yet it is used in different disciplines to mean different things. To avoid, terminological confusion, it is important to bear in mind that the term validation throughout this paper has come to be used in its broadest sense to refer to validating suitability or usability of proxies using EO data for GIA. A validation is a fundamental requirement when using EO data in any mapping project. It provides a basis for identifying classification errors and enables the overall accuracy and uncertainty of mapping outcomes to be estimated with sample data [47]. Congalton [48] lists three reasons why validation has become so important. It enables (1) the identification and correction of usage errors in images, (2) a robust quantitative comparison of methods, and (3) the provision of more reliable information to enable better informed decision making. Much of the available literature on remote sensing deals with the question of accuracy; however, Campbell and Wynne [49] critically warn that validation is a much more complex process, as considered by many, and it displays obvious difficulties in convincingly addressing whether the outcomes are correct. The design of sampling, response, and analysis processes are an important component of accuracy assessment and play a key role in yielding rigorously defensible validation in remote sensing science [47]. As conventional methods, there are several possible validation techniques available to examine the accuracy or error of EO data, such as visual inspection, non-site-specific analysis, difference image creation, error budgeting and quantitative accuracy assessment [48,49]. Indeed, the renewed interest in various image classification methods, such as geographic object-based image analysis (GEOBIA), necessitates a range of different validation efforts to meet the respective characteristics [50].
Despite the cutting-edge advancements in EO data validation, it should be noted that remote sensors on satellites and aircraft cannot directly detect and record a particular social, political, economic, or historical context of landscapes and their internal dynamics [51]. Unlike the remote sensing community, some studies confirmed that including participatory techniques with professionals in land administration domain helps to validate quality and usability [52,53]. Moreover, a qualitative approach in conjunction with visual interpretation and quantitative analysis to measure the scalability of the semi-automated cadastral boundary feature extraction from remote sensing data has been applied to the validation [54]. In the same vein, using EO data to derive proxies for identifying and interpreting unknown land tenure relations requires a rigorous interpretation of various contextual information and a more nuanced insight into the socio-legal-spatial properties. Therefore, there are limits to how far conventional and solid validation procedures, which have already been formed and customized for specific artefacts and validation objectives in the remote sensing community [55], can be taken in land administration science. In other words, a tailored validation protocol would help to establish a higher accuracy and feasibility based on the results obtained from the EO data interpretation.

Knowledge Co-Production: Scientific, Bureaucratic and Stakeholder Knowledge
The notion of "knowledge co-production" is commonly used to refer to the collaborative and interactive process of synthesizing different sources and types of knowledge [56][57][58]. Co-produced knowledge blurs the boundaries between science and practice [59]. Therefore, not only experts, scientists and professionals now play a pivotal role in the decision-making process, but the committed knowledge of non-scientific stakeholders should also be taken into account.
According to Freedman [60], if scientifically valid trials of a useful or interesting hypothesis are conducted or provide reliable information on the hypothesis being tested, the values and validity are recognized as scientific. To enable this, scientific validation needs to consider the inclusion of expert knowledge at higher levels of education and professionalism in order to test for scientific acceptability such as transparency and replicability of results [61]. Bureaucratic knowledge is now considered essential, as bureaucrats and civil servants possess advanced knowledge from governance processes that include top-down political representation, bottom-up citizen participation and informal knowledge-sharing networks [62]. In administrative and government practice, "bureaucratic knowledge" serves to navigate complex decision making. It associates the political and strategic use of knowledge rather than the intrinsic (non-instrumental) value of knowledge, such as the norms of ethos and ethics, with bureaucratic works [57,[62][63][64]. Thus, bureaucratic validation is such that it synthesizes knowledge from both internal and external resources and creates new forms of knowledge from perceptions of political feasibility and institutional arrangement [65]. Unlike scientific validation, which is based on replication logic, bureaucratic knowledge relies on either pragmatic plausibility or feasibility logic, similar to political logic.
However, critical questions have been raised about the uncertainty of decision making in resolving multifaceted local and societal problems on the sole basis of scientific and expert knowledge [66]. One of the most significant current discussions in this argument concerns the incorporation of varying stakeholder knowledge that reduces rigidity, represents multiple perspectives, and promotes adaptability in decision making [67]. There is also a growing body of literature that recognizes the importance of stakeholder or lay knowledge as a key informant that emphasizes intense contextual and localized knowledge of people in their local environments [68]. Therefore, the potential advantage of using stakeholder validation is the increased precision both in terms of context and localization in validating the suitability of proxies that cannot be verified through disciplinary expert assessment or administrative capacity. In view of all that has been mentioned so far, one may suppose, as argued by Edelenbos et al., [57] that only coproduced knowledge fully assesses pre-identified proxies for GIA that considers scientific validity, policy relevance and social robustness.

Geospatially Informed Analysis (GIA)
In recent decades, describing, analyzing, and understanding people-to-land relations using geospatial technology has given rise to effective legal, social, and spatial solutions to multifaceted problems relating to land. Recently, more advanced geospatial intelligence has not only offered administratively straightforward, technically feasible, and financially affordable approaches [69], but it has also provided a rich set of data and information that conventional analytical techniques would have been unable to identify or access. We also view land management as a combination of interventions in governance, based on questions of how and under what conditions such land interventions are responsible and how these can be supported by technologies. It is possible, therefore, that GIA supports both smart and responsible land management [70,71], especially of difficult-to-access regions where unknown or unsupported land governance exists [21,39,40].
On the one hand, geospatial intelligence is not currently sufficiently embedded in decision-making processes, while on the other, decision makers do not sufficiently rely on geospatial intelligence, even though it is available. Even geospatial intelligence is too product-oriented and insufficiently process-oriented. Based on this line of argument, we note that, despite the above claims of GIA, it still needs to be clarified how, where, and when it can be used to enrich both scientific and bureaucratic knowledge. Building on the critical insights from GIA, it enables us to address proxy development in a smart and responsible manner, where significant uncertainty exists regarding data access, data integration and data reliability.
GIA is fast becoming the ultimate driver of spatial decision making in land management and sheds new light on recent insights into societies, the environment, the earth, resilience, and sustainable development. However, scientifically framed knowledge and technical expertise in remote sensing and earth observation tends to greatly exaggerate the excellence of laboratory experiments conducted under highly controlled conditions and technocratic approaches to dealing with land/spatial problems. At the same time, the dominance of bureaucratic knowledge in land policymaking devalues other forms of knowledge and undermines the local context, the political representation of citizens and the social processes of land governance and land use. Using geospatial tools and instruments, the citizen (stakeholder) is now able not only to consume and produce geospatial information but also to contribute grounded knowledge more effectively to spatial decision-making processes. Synchronization, complementing and contradiction with views, judgements and experiences in the knowledge coproduction process affect the GIA utilized in spatial decision making and thus determine exactly how, where and when different forms of knowledge can legitimize scientific standards and conformity as bureaucratic and social norms.

A Case Study: Geospatially Informed Analysis of North Korea
The method was designed for which proxies are considered relevant and useful by scientists, government professionals and stakeholders and to elicit their evaluation of the information quality. This necessitated very careful investigation. A survey was conducted with 77 sample respondents recruited from scientific, bureaucratic and stakeholder groups. Data for this study were collected using a web-based questionnaire, and the analysis used both the Chi-square test and the one-way ANOVA test. The following subsections describe in greater detail what was investigated and who was involved, how the survey was conducted and how the data was analyzed.

Identification of Proxies and Quality of Information
The first important stage of the analysis was to identify proxies by which to derive unknown land tenure relations in North Korea in conjunction with EO data. A preliminary investigation proposed a set of candidate proxies relating to the key questions based on the elements of image interpretation used in remote sensing, divided into the four categories of land ownership, land use, land transfer, and land access [21]. Within these categories, a total of 66 proxies were derived from 32 groups of objects, environments and semantics that were visible in the EO data and that could be (re-)interpreted to discern unknown land tenure relations in North Korea (see Figure 1). These proxies generally consisted of combinations of shape patterns, colors, textures relating to physical structures, types of buildings, infrastructures, types of land use, and proximity of comparable features. The line of reasoning attached to the proxies is significantly associated with central concepts of tenure claims and interests such as collective ownership, land lease and use, occupation, transactions, and land access. Hence, in line with our approach to validating the suitability of remotely sensed proxies, we set out to test the hypothesis that determines "whether proposed proxies are (1) scientifically valid, (2) administratively relevant or useful, and (3) contextualized and localized." Hence, the null hypothesis (H 0 ) is "no difference between scientific, bureaucratic, and stakeholder distributions (of agreements) for identified proxies." One of the most influential accounts of a methodology for information quality assessment comes from [72], which sets out the theoretical dimensions of information quality (IQ) and comprehensively examines four key quality attributes from both academics' and practitioners' perspectives: (1) intrinsic, (2) contextual, (3) representational, and (4) accessible. Both intrinsic and contextual quality underline the informative factors, but intrinsic attributes are associated with accuracy, believability, reputation, and objectivity, whereas the latter considers tasks that require added value, relevance, completeness, timeliness, and an appropriate amount. On the other hand, both representational and accessibility dimensions stress the technical accounts of the system by which information must not only be interpretable, easy to understand and represented clearly and consistently, but also emphasize accessibility and security [72]. To test the hypothesis, two different approaches were taken in an attempt to account for the identification of proxies (Part I of the questionnaire) and the measurement of information (proxy) quality (Part II of the questionnaire).

Participants
The participants (see Table 1) were divided into three groups on the basis of their knowledge production methods: (1) scientific knowledge, focusing in particular on topical, methodological, and contextual interests (scientists); (2) bureaucratic knowledge that exists in the context of administrative and governmental practices; and (3) stakeholder knowledge of key informants with the emphasis on contextual and localized knowledge of North Korea. A random sample of participants (i.e., thus, it is unable to report the participation rate exactly.) with a different set of knowledge was identified from personal networks and connections with government agencies (from Korea Land and the Geospatial Informatix Corporation (LX) as well as local authorities under the Ministry of Land, Infrastructure and Transport (MOLIT)), (non-)governmental organizations (the Korea Hana Foundation, the Together Foundation, and the Saeil Academy), academic and research institutions (from universities, the Korea Research Institute for Human Settlements (KRIHS), the Land & Housing Institute (LHI), and the Spatial Information Research Institute (SIRI)) in South Korea. Therefore, group A and B were recruited based on South Korean's expertise, while group C consisted of North Korean refugees living in South Korea.
A total of 77 participants took part in the study. Of the total cohort of 77 participants, 29 were members of scientific knowledge groups (38%) while 30 and 18 respondents were from bureaucratic (39%) and stakeholder knowledge groups (23%), respectively (see Table 1). The Participants A group comprised scientists representing a broad range of expertise and domains in the fields of land management, land administration, land governance, land tenure, and cadastral surveying. This group also included eligible specialists with substantial knowledge and skills in remote sensing and earth observation technologies. In addition, participants were recruited from independent entities who share knowledge and a deeper understanding of Korean (re-)unification. The Participant B group represented bureaucratic knowledge and policy usefulness, with the following parameters: government professionals and officials who demonstrated a set of professional skills and had gained relevant work experience in public sectors involving land tenure/administration/management, land/cadastral surveying, and geospatial information. To create our stakeholder sample, we considered people with a declared or conceivable interest or stake in land tenure relations, land governance arrangements and land-use practices in North Korea. Thus, the Participants C group involves judgements of stakeholders who have the most direct and accurate understanding of land systems in North Korea by virtue of their life experience; in our context, this refers to North Korean refugees.
To begin this process, each participant group was invited via multiple contact points to participate in the study, with a link to the online questionnaire included. The participants were asked to complete two tasks relating to the identification of proxies and the measurement of information quality. The invitations included a clear explanation of the purpose of the research, along with an introductory statement and instructions attached to the questionnaire. Originally, the questionnaire was compiled in English. However, it was subsequently translated into Korean to gain a better understanding of the possibilities of identifying proxies and measuring information quality. The participants were asked to complete two parts of the anonymized questionnaire within two weeks (29 June-15 July 2020). Note. * For scientific and bureaucratic groups had land administration, management, remote sensing, or unification-related work experiences; on the other hand, it did not require relevant professional knowledge and experiences for stakeholder groups.

Questionnaire
The questionnaire was developed in consultation and discussion with international and local scientific communities (e.g., universities and research institutions), (1) by sharing cutting-edge scientific knowledge on smart and responsible land management; (2) by comparing how local contexts influence land tenure relations, especially in developing countries; and (3) by underpinning a new conceptual and methodological account of a geospatially informed analysis in a remote sensing community.
Due to budget constraints, time-limitations, and travel restrictions (owing to , the data was collected using a web-based questionnaire based on the freely available Google Forms questionnaire. In order to identify the most transferrable and applicable proxies, the participants were asked whether they agreed or disagreed by choosing one of two possible values on a binary scale. One advantage of the binary scale is that it avoids the problem of nuanced and neutral answers from respondents. By forcing respondents' options, we obtained precise data with which to clarify and confirm the proxies identified beforehand (See Figure 1). In addition, after each proxy group category, participants were able to add their comments or suggestions for additional candidate proxies in a supplementary space (see further details of a questionnaire with a following link: https: //forms.gle/8SzK323vYWBhRfhF8; it is available only in Korean.). To provide a good and full understanding of the questionnaire's survey content, the questionnaire was fully described with the background, purpose, and key terms of study, and provided sample satellite images in each section with detailed descriptions. We also amended the question format into a more respondents-friendly form with the help of a communication expert as well as many land administration specialists in South Korea to make it easier for respondents to answer the questions.
Unlike the binary scale format, Likert items allow more finely tuned responses and enable respondents to indicate the extent of their agreement, including a neutral response to the questions. For the attitude questions measuring information quality, a 5-point Likert scale was used to ask respondents whether they agreed or disagreed, with the following possible variations: excellent, good, fair, poor, very poor. Questions on measurement were in part adopted from AIMQ (a methodology for information quality assessment) methodology [67] and referred to such as aspects as believability, completeness, consistent representation, interpretability, objectivity, relevancy, timeliness, and understandability. Finally, the participants were asked to leave an e-mail address if they wished to know the results of the study.

Data Analysis
To formally compare the views and judgements of different group samples in identifying geospatially informed proxies, the Chi-square test was selected to test whether there were any (significant) differences in the distributions across scientific experts, bureaucrats, and stakeholders. We adopted the Chi-square test since this test is also suitable for more than two nominal variables or arbitrary dimension (R × C rather than 2 × 2) [73,74]. Proxies representing only a few statistical differences (p < 0.05) were considered to be in agreement. The experiment was conducted with two possible outcomes (agree or disagree) with the results of the proxy identification (validation) being expressed as a proportion of the overall respondents from the scientific, bureaucratic and stakeholder groups, since our data were derived from random sampling.
To measure information quality, a one-way ANOVA test was conducted. The oneway ANOVA test is one of the most commonly-used techniques for determining whether or not there are any statistically significant differences between the means of two or more independent variables (e.g., between groups, within groups). In an experiment, the measurement variable is the independent variable; thus, scientific, bureaucratic, and stakeholders' standpoints, respectively, were determined. The nominal variable is the dependent variable and can take one of five values (very poor/poor/fair/good/excellent) relating to the quality of the information on proxy identification, based on a 5-point Likert scale. It is equal to 1 if respondents give the answer "very poor" with regard to believability, completeness, consistent representation, interpretability, objectivity, relevancy, timeliness, and understandability. On the other hand, it is equal to 5 when participants consider it to be "excellent". Prior to the one-way ANOVA test, we also conducted D'Agostino-Pearson normality and lognormality tests to determine whether the data set was well-modelled (that the given sample comes from a normally distributed population). We followed up the one-way ANOVA test with Tukey's multiple comparison test (Tukey-Kramer test) to compare every variable with every other variable.

Land Ownership (LO)
To identify land ownership, 15 proxies were incorporated in the analysis (See Figure 1). Of these 15 proxies, first eight (LO. 1 to LO. 8) were associated with the identification of collective (farm)land and the remainder (LO. 9 to LO. 15) with state (farm)land. Null hypothesis (H 0 ) cannot be rejected, i.e., that there is no difference between scientific, bureaucratic and stakeholder distributions for nine proxies. For these nine proxies, the judgements elicited from scientific knowledge are in agreement with those observed in bureaucratic groups. On the other hand, we reject the null hypothesis that there is no difference between the three different knowledge groups for the following six proxies: rough/coarse image texture of (dry)paddy fields, high density/compactness of settlements, signature line of a slanting roof of rural dwellings, observation of seasonal changes in agricultural activities, small dot-shaped patch of orchards, smooth texture of pastures, and low density of building (sites).
It was found that the Table 2 compares the results obtained in the Chi-square test of validating the suitability of proxies for land ownership identification. In general, when a p-value is less than 0.05 for each proxy (No 2, 3, 5, 8 to 10, and 12), it means that the agreements elicited from the scientific, bureaucratic, and stakeholder groups are highly inconsistent, and thus hinder validating the suitability of a set of proxies. Although there is higher rates of disagreement among scientific group (ranging from 13.8 % to 48.3%) arising from the interpretations of the identified proxies, the possible proxies for land ownership identification derived from EO data (LOs. 1, 4, 6, 7, 11, and 13 to 15) achieved a better understanding among the bureaucratic (mean average: 46.2%) and stakeholder groups (mean average: 63.9%). Note. Agreement of knowledge groups means the percentage of yes in the survey. * p value ≤ 0.05; statistically significant between knowledge groups. ** p value ≤ 0.01; statistically very significant between knowledge groups. *** p value ≤ 0.001; statistically extremely significant between knowledge groups.

Land Use Rights (LU)
There were 35 proxies incorporated for the identification of LU from EO data. Proxies LU. 1 to LU. 24 reflect aspects of individual land use rights, while another explanation of group land use right is associated with proxies LU. 25 to LU. 35. Significant associations for the difference between scientific, bureaucratic and stakeholder's agreements were not found to be related throughout a Chi-square test for all the possible proxies of land use rights. In other words, we retain the null hypothesis (p value ≤ 0.05) that there is no difference between the knowledge groups. All participant groups agreed that much uncertainty (judging by the agreement ratio of ≤ 50%) still exists concerning the relationship between EO data and the identification of land use rights at some points. However, statistical difference does not fully account for difference in actual opinions. In other words, the χ 2 and p values only demonstrate a statistically significant difference, which is the result of a rational exercise with numbers but does not denote any practical significance in that there is no difference. Table 3 shows the breakdown of χ 2 and p values along with the fraction of total agreement for validating the suitability of proxies for identifying land use rights. There is still no systematic understanding of how EO data contributes to land use rights identification (LUs. 11 to 15, and 24) among scientific, bureaucratic and stakeholder knowledge groups (less than 30% are in agreement); however, strong evidence was found in support of the validation of the six proxies in individual and seven proxies in group land use rights with agreement of at least 40 percent in two separate groups (LUs. 5, 6, 7, 16, 17, 22, 25 to 27, 29 to 31, and 34). Table 3. Validating the suitability of proxies for land use rights identification and differences between knowledge groups.

Land Transfer Rights (LT)
The proxies used to identify LT had 11 responses to the questions of each knowledge group based on the following key components: small plots (sotoji) divided into a garden plot (GP), side-job plot (SJP), and a tiny patch of land (TPL) in North Korea. As with data obtained in the previous section on land use rights identification, we also found that there is no statistical difference in a set of given observations (p value ≥ 0.05). Therefore, we do not reject the null hypothesis for the difference in views, judgements, and experiences between scientific, bureaucratic, and stakeholder distributions on the proposed proxies for land transfer rights. There remain several aspects concerning small plots (sotoji) about which relatively little is known to scientific and bureaucratic knowledge groups in South Korea (only less than a third (30%) agreed on confirming land transfer rights). However, if we could turn for a moment to look at both Table 4, we can see that the stakeholder group with the most accurate understanding of land tenure relations, land governance and land use practices had a higher mean estimated percentage (32%) of agreement than the average ratio of other groups, with the validation of eleven proxies. This is especially the case with LT. 2 (38.9%), LT. 3 (44.4%), LT. 4 (38.9%), and LT. 6 (44.4%).

Land Access Rights (LA)
As mentioned in the previous study [21], assuming and identifying EO data proxies for LA in North Korea is one of the most challenging problems, as private land tenure is not recognized in North Korea, and thus there are no land use regulations arising through the restriction of private rights. With regard to restrictions of land access rights for public purpose only, five proxies were included in the analysis. On average, these proxies received the highest agreement among identified land tenure claims, ranging from 34.5% to 70% among scientific, bureaucratic, and stakeholder knowledge groups. As Table 5 shows, there is a significant difference between the bureaucratic and scientific/stakeholder groups in the proxy with fewer green colors and rough textures in public utility networks/nature reserves/heritage sites. Thus, the null hypothesis that there is no difference between scientific, bureaucratic, and stakeholder distributions for this proxy cannot be rejected.
What is interesting about the data here is that the bureaucratic knowledge group obtained the highest level of agreement on proxy identification (63.3%, 60.0%, 70.0%, 66.7%, and 50.0%, respectively in order).

Measurement of Information Quality
The participants were asked to consider data, information, or knowledge with regard to whether an elaborated meaning or an interpreted information element is valid or not and then to complete an eight-question survey about information quality. Of the eight aspects, there was no statistically significant differences between group mean values (1-5) for believability, completeness, consistent representation, interpretability, objectivity, and timeliness as determined by one-way ANOVA. This indicates a high level of consensus on information quality among the different knowledge groups. On the other hand, there was a significant difference from those of variables between the means of three groups both on relevancy at the p value ≤ 0.05 level for the three conditions (between/within/total) and understandability of information (see Table 6). However, the one-way ANOVA test does not tell us where the difference exists, and which specific groups differed. The post hoc Tukey test indicated that the relevancy in scientific and bureaucratic groups (A-B) and bureaucratic and stakeholder (B-C) groups differed significantly at p ≤ 0.05; regarding understandability, there was a statistically significant difference (p = 0.018) between the scientific and bureaucratic (A-B) groups (see Table 7). Table 6. Differences in information quality between knowledge groups.

Summary and Discussion
The principal limitation of closed questions in a questionnaire restricted respondents' answers and expressiveness, enabling us to clarify and confirm the proxies we preidentified (in Figure 1). A small sample was chosen because of the expected difficulty in obtaining a high-level of expertise and an accurate understanding in the given context, based on the fact that significant uncertainty exists regarding geospatial and socio-economic data access, data integration, and data reliability.
Up to now, there have been no controlled studies which compare differences in findings. However, the experimental work presented here provides first investigation how pixel-based land tenure information become legitimate land tenure knowledge to some extent. Although differences of agreement still exist, the most obvious findings to emerge from the analysis is that there appears to be some agreement in judgements of proxies among scientific, bureaucratic, and stakeholder groups. They agree on some proxies but not on others. Weak associations of EO data and land ownership were identified for eight prox-ies, including such as coarse image texture of (dry)paddy fields, high density/compactness of settlements, a linear roof of rural dwellings, seasonal changes of agricultural activities, and small dot-shaped patch or orchards.. On the other hand, some proxies are associated with both collective and state land ownership and are strongly supported by bureaucratic and stakeholder groups that stress the political and strategic use of proxies and possess the most localized and contextualized knowledge.
With regard to proxies for land use rights identification, we can confirm that six proxies relating to individual use rights such as presence of different types of houses/allotments, detached small-size allotment buildings and low built-up allotment land and seven to group land use rights (e.g., amalgamation of community amenities and increase in the number of houses in a certain vicinity) were identified out of a total of 35 proxies. These were found to be particularly associated with houses, allotments, land conversions and improvements to the location. However, no significant agreement was found for all groups, especially in apartment-types of proxies (e.g., rectangular forms, regular alinements, imperviousness, or shadow silhouettes of apartments). These proxies could have been generated by misclassification bias or an erroneous assumption when identifying geospatially informed proxies. The reason for this is not clear but it may have something to do with the nature of apartments where multiple objects reside.
In addition, the findings indicate that elements of EO data interpretation such as color, shape, size, height, and site (e.g., large parcel size of SJP, SJP with green colors, lower elevation, and gentle slope of TPL) may not be associated with land transfer rights. However, there is a knowledge gap resulting from a lack of clear understanding of specific aspects and details of small plots (sotoji) in North Korea by South Korean scientists and government professionals. The stakeholder group that has the most accurate understanding of land tenure relations, land governance and land use practice showed significantly higher ratio of agreement. Regarding the identification of land access rights, whilst there was strong agreement in the validation of proxies between all knowledge groups and considered to be most negotiated knowledge that is scientifically most valid, policy-relevant, and socially robust among others in this study. As far as infrastructure elements are concerned, three groups have shown a higher level of agreement among other proxy selections. We may assume that a proxy identification in relation to infrastructure in North Korea could be more important than anything else. In other words, it is possible that these identified proxies could account for unknown aspects of land access rights in North Korea.
However, these results also need to be interpreted with caution. Firstly, we revealed a strong and consistent association between land ownership and EO data and the mean average of agreement in stakeholder groups-for those with the most localized knowledge of land tenure-were higher compared to those of other groups. If the debate is to be moved forward, a better understanding of different perceptions on land tenure among North Korean refugees needs to be developed. It can be relatively easier for North Korean refugees to distinguish collective and state (farm)land through EO data because they have empirically familiar with the socialist land tenure system. Another reason to support this claim may be that there was an obvious difficulty with defining the term which have accustomed with South Korean land management practices.
Secondly, the present results were significant in at least two major respects. The experimental data suggested that the three groups considered in this study were all in a higher degree of agreement on identifying land use rights, nothing in particular really stood out, but the agreement was distributed evenly at a relatively higher level than the average of those observed in other claims. However, some of those experts still argued that idea was not feasible to empirically derive changes in land use rights in North Korea with EO data. Despite the fact that there has been increased numbers of North Korean refugees (approximately 30,000 residents), one argued that it still remained challenging to understand the notion of "individual" land use rights, according to his/her own experiences of having worked and lived in North Korea.
Thirdly, all knowledge groups showed that the proxies for land transfer rights identification were appeared to the lowest in the level of agreement. However, the questions came up against the great problem of reliability of reported data and we are often not in a position to know whether it enabled participants to provide fairer, more objective, and more accurate and reliable assessments for validating the suitability of geospatially informed proxies. For instance, making a judgement on this, however, inevitably makes additional demands for expertise of either land management or remote sensing. Furthermore, these validations require in-depth local knowledge of the distinguishing feature in North Korea (i.e., sotoji) that is necessary to make association between EO data and land transfer rights. To further identify the proxies, it is necessary to rely on multiple techniques and methods, which combine both direct responses of individual, ranked, and stated choice responses of either individuals or groups and indirect collections of perceptions, beliefs, and social values. In addition to interviews and focus group discussions (FGD), one could add a number of other relevant tools and techniques, such as, Q methodology (which combines quantitative and qualitative data collections techniques with statistical and interpretative data analyses methods), the Delphi technique (which relies on consecutive perceptions and interpretations), and multi-criteria decision analysis (MCDA) (which combines and infers from multiple opinions and preferences).
In order to ensure whether proposed proxies for land access rights were valid, the participants were asked to select between the two, either agree or disagree. Of the 77 participants who completed the questionnaire, nearly half reported that land access rights could be identified in line with the proxies and the EO data. One could argue that this finding is largely biased by the selection of respondents who had similar epistemic backgrounds. The consequence of this bias could be that the understanding of what land rights constitute and what not, would reflect the acquaintance with private rights tenure regimes only and perhaps more limited awareness and experience with State-based tenure regimes. Hence, it is important to keep this possible bias in these responses in mind, especially from scientific and bureaucratic groups.
Nevertheless, regarding the measurement of information quality (see Figure 2), the result was that we expected. Although EO data proxy identification for land tenure relations in North Korea seems to be strongly relevant to respondents' research, policies, and social interests (i.e., relevancy of information; timeless of information), many participants did suffer from a lack of consistent representation and interpretability of information. In order to further investigate and confirm this finding, a provision of multi-disciplinary training will enhance both researchers' capacity to understand land tenure and land governance in question and policymakers' confidence in making spatial decisions in the context of Korean (re-)unification based on GIA. This also enables multiple engaged stakeholders to reveal the interconnection of geospatial science in land management practice. All these require affinity with multiple technical disciplines such as geoinformation and earth observation sciences, civil and environmental engineering as well as sensitivity of social and political processes including public administration, law, economics, and (human) geography. Secondly, one major drawback when implementing GIA was that non-remote sensing scientists, government professionals and North Korean refugees were suffered from scientific and technological literacy [75,76] (i.e., a high density of technical terms of remote sensing used in research). As this case very clearly demonstrated, it is important that reformulating the scientific language in a communicative style should be considered to facilitate active stakeholders' engagement in advancing GIA. Lastly, the existing and grounded knowledge of GIA needs to be translated into voluntary guidelines, policy briefs for scientists, policymakers, and other interest groups. In addition, fact-finding projects from around the globe where unknown land tenure and unsupported land governance exists needs to be implemented.
Different forms of evidence can be used to inform spatial decision making in land management, with data being gathered via statistical and administrative evidence (from government), analytical evidence (by scientific experts), evidence from citizens and stake-holders and evidence from evaluations [77]. This also accords with our approaches, which showed that how a knowledge co-production process helps to validate suitability of geospatially informed proxies and become legitimate land tenure knowledge. Given the fact that this study has to be conducted with the best of all qualities, it should confirm or reject our hypotheses as analytical evidence that may report a possible association between EO data and land tenure with a case study in North Korea. Then, by incorporating and reflecting on local spatial knowledge from multiple stakeholders (i.e., scientists, government professionals and North Korean refugees), it enabled us to tell policymakers what land tenure knowledge they consider legitimate (i.e., scientific validity, policy relevance and social robustness) and what counts as geospatially informed evidence. It is only after knowledge (evidence) co-production processes, the finding of this study supports the view that we can bridge and close the gap between technical aspects of the EO data evidence generation and operational contexts in spatial decision making in land administration and management. Much of the available literature so far on remote sensing for land administration is too product-oriented for skilled and trained technicians [12][13][14][15][16][17][18][19] and insufficiently process-oriented for policymakers and end-users, allowing them to make decisions in the most rational and informed way possible with EO data [22]. The point is not to go against the promising ideas on RS applications, techniques, products, and methods, but to really emphasize that it is an opportune time to undertake the most engaged and negotiated knowledge for both evidence generation and provision of salient and legitimate evidence in responsible and smart decision-making in land administration. This approach can be a way forward remote sensing for land administration 2.0.

Conclusions
The aim of the research question in this study was to determine the extent to which scientific, bureaucratic, and stakeholder knowledge coincides with a set of identified proxies that would enable us to conclude whether certain proposed proxies are scientifically valid, administratively relevant, contextualized, and localized. The findings from this study could then be used to standardize the identification and categorization of certain objects, environments, and semantics visible in EO data that can (re-)interpret land tenure relations in North Korea in preparation for Korean (re-)unification.
Of the four different land tenure claims, both Chi-square and one-way ANOVA analysis revealed that the distribution of agreements relating to land ownership and land transfer rights identification varied among scientific experts, bureaucrats, and stakeholders. Moreover, it was possible to measure intrinsic, contextual, representational, and accessibility attributes of comprehensive information to ascertain associations between EO data and land tenure relations in North Korea based on different viewpoints. From here, the step towards enhancing and developing the existing account is clearly supported by the current findings on information quality.
The findings of this investigation complement those of a previous study relating to a conceptual and methodological development of a geospatially informed analysis in the land administration domain [21]. These findings contribute in several ways to our understanding of how the pixel can be converted to legitimate land tenure knowledge. First, it can help us establish a tailored validation protocol with a higher accuracy and feasibility based on the identification and interpretation of unknown land tenure relations derived from EO data and various types of contextual information as well as a more nuanced view of sociolegal-spatial properties. Second, these findings, being based on knowledge co-production, are relevant to scientists, policy-makers, and practitioners involved in the decision making process relating to land tenure reform and land governance rearrangement on the basis of emerging geospatial technologies and datasets in the context of Korean (re-)unification. Furthermore, the methods used in this study can also be applied to other cases elsewhere in the world, in particular, difficult-to-access regions or fragile and conflict-affected areas. Lastly, the present study contributes additional evidence of geospatially better-informed analysis that emphasizes scientific validity, policy relevance, and social robustness within a responsible and smart land management framework. The geospatially better-informed analysis is not about how geospatial intelligence can directly detect information but how technology can smartly and responsibly support better information regarding land issues for the benefit of scientists, policymakers, and stakeholders.
Although the current study is based on a small sample of participants and used a focus group questionnaire, it offers valuable insights into new validation techniques of suitability for EO data in the land administration domain based on conventional practices that have been formed and customized to accommodate the specific artefacts and validation objectives used in the remote sensing community. The scope of this study was limited in terms of participants' knowledge, for example their level of expertise (scientific), administrative involvement (bureaucratic), and knowledge of locales in geographic areas of interest (stakeholder). However, the limited number of samples adds further caution regarding the generalizability of these findings. Thus, further investigation and experimentation to develop the internal and external validity of findings and GIA methodology would be of great help in understanding the associations between EO data and land tenure claims. Considerably more work will need to be done to identify intrinsic links between geospatial data and land tenure relations. It will then be necessary to concentrate on the development of EO data interpretation in line with artificial intelligence (AI) so as to be able to delve deeper into the future of land administration.  Informed Consent Statement: All prospective participants had been informed all the information (i.e., voluntary nature of study, risks, benefits, procedures to maintain confidentiality, etc.) and agreed to participate before proceeding the E-survey.