Next Article in Journal
A Citizen Science Approach for Collecting Toponyms
Next Article in Special Issue
Place and City: Toward Urban Intelligence
Previous Article in Journal
HidroMap: A New Tool for Irrigation Monitoring and Management Using Free Satellite Imagery
Previous Article in Special Issue
Deep Belief Networks Based Toponym Recognition for Chinese Text
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Graph Database Model for Knowledge Extracted from Place Descriptions

Department of Infrastructure Engineering, University of Melbourne, Parkville, VIC 3010, Australia
*
Author to whom correspondence should be addressed.
ISPRS Int. J. Geo-Inf. 2018, 7(6), 221; https://doi.org/10.3390/ijgi7060221
Submission received: 15 April 2018 / Revised: 3 June 2018 / Accepted: 13 June 2018 / Published: 15 June 2018
(This article belongs to the Special Issue Place-Based Research in GIScience and Geoinformatics)

Abstract

:
Everyday place descriptions provide a rich source of knowledge about places and their relative locations. This research proposes a place graph model for modelling this spatial, non-spatial, and contextual knowledge from place descriptions. The model extends a prior place graph, and overcomes a number of limitations. The model is implemented using a graph database, and a management system has also been developed that allows operations including querying, mapping, and visualizing the stored knowledge in an extended place graph. Then three experimental tasks, namely georeferencing, reasoning, and querying, are selected to demonstrate the superiority of the extended model.

1. Introduction

Place descriptions occur in everyday verbal communication as a way of encoding and transmitting spatial and semantic knowledge about places between individuals [1]. They also occur in written forms such as news articles, social media texts, trip guides, and tourism articles, and thus, the web provides a plethora of place descriptions as well. Information conveyed by a place description, such as place references, spatial relations, and human activities, represents the locative and semantic knowledge of the descriptor about the referred places. It has been postulated that such information might be leveraged to create a place-based information system for modelling and utilizing human knowledge about place [2]. In order to set up such systems, three relevant research problems can be identified: information extraction through natural language (NL) processing, information modelling, and knowledge creation and utilisation. For the first task, techniques such as gazetteered place name identification (e.g., [3,4,5]) and spatial relationship extraction (e.g., [6,7,8]) from texts have already been developed. For the second task, various models have been proposed, such as the GeneralizedUpperModel(GUM) [9], SpatialML[10], and a prior graph-based model [1] that will be significantly extended in this research. Accordingly, this research focuses on the second and third tasks, i.e., modelling and utilizing knowledge extracted from place descriptions.
Place descriptions have been regarded as a qualitative reference system for describing geographic locations, and consist essentially of references to places and their qualitative spatial relationships. Information extracted from place descriptions has been used to construct place graphs [1], which consist of places as nodes and spatial relationships as edges. The edges are directed from locatum (L) to relatum (R). For example, the description ‘The courtyard is on the campus, beside the clocktower’ describes the location of the courtyard in relation to two other places, the campus (as a container) and the clocktower (as a neighbour). This information can be modelled in the form of triplets of a locatum L, the reference to a place that is to be located, a relatum R, the reference to a place that is already located, and a spatial relationship r between the two: <L: courtyard, r: on, R: campus> and <L: courtyard, r: beside, R: clocktower>. The two triplets are used to construct the simple place graph shown in Figure 1.
Though these place graphs are place knowledge bases [11,12], the triplets that build them are stripped off of much of their conversational contexts. It is, therefore, possible to find incompatible information in a place graph, especially if the graph is constructed from combining place descriptions with different contexts. For example, it is perfectly possible to collect seemingly contradicting triplets such as <Melbourne, near, Sydney> and <Melbourne, far, Sydney> from two place descriptions, since the use of the different distance prepositions is context-dependent. Storing such triplets in a place graph without further capturing their original contexts could result in loss of information and misinterpretation. Moreover, the interpretation of spatial relations found in NL place descriptions often relies on information that is not explicit and has to be inferred. For example, people infer whether the reference frame of relative direction relations, as in <courtyard, left of, clocktower> is intrinsic or relative. While people are often capable of such inferences, the basic place graph only stores explicit information. The above place graph also do not consider place semantics or place-related human activities, which are valuable for further spatial reasoning and query answering.
Consequently, the usefulness of the original place graph model from a knowledge base perspective is restricted. Currently, query answers are provided by matching the values of certain property keys and by graph traversing, without filtering for context, inferences, or semantics, while the interpretation of spatial relationships such as relative directions or qualitative distances is limited.
This work reorganizes, revises and extends the original place graph model. The goal is to capture information from place descriptions that is useful, but lost during the triplet extraction. The hypothesis of this research is that, the extended place graph overcomes significant limitations of the original model in georeferencing, reasoning, and querying tasks. Specifically, the extended model can be used to derive more constrained locations of places for georeferencing. It captures additional information such as reference frames to be used in maintaining relational consistency, and it is capable of answering additional spatial queries. Accordingly, the contributions of this research include:
  • The identification of eight types of information that are either embedded in place descriptions or in external context and have not been captured by the original place graph model.
  • An extended place graph model that represents such information and enables future tracing as well as querying.
  • The implementation of the extended place graph into a graph database management system, which allows operations including querying, visualizing, and mapping.
  • The demonstration of how the extended model overcomes limitations of the original place graph in georeferencing, reasoning, and querying tasks based on three experiments.
The remainder of this paper is structured as follows: In Section 2 related work on place, place models, place descriptions, and place graphs is provided. Section 3 identifies the information not captured in the original place graph model, and introduces the extended model. Section 4 looks at the extended model’s implementation and three experiments demonstrating its superiority. In Section 5 the experimental results are discussed, and the highlights of this work are presented in the concluding Section 6.

2. Related Work

Place based research is an emerging field in GIScience with importance widely acknowledged (e.g., [13,14,15]). The purpose is to smooth and simplify human-computer interaction by capturing, modelling, and utilizing place-related information. For example, Egenhofer and Mark suggested Naive Geography in order to capture and reflect the way that non-experts think and reason about space and time [16]. In this section, related work about how places are conceptualized, modelled in information systems, and communicated in descriptions is discussed.

2.1. Place as a Cognitive Concept

Space and place are two fundamental concepts in geography, and more broadly in social sciences, humanities, and information science [17]. Although the concept of place has existed for long in philosophy and psychology [18], it is relatively new in the GIScience domain. People talk about space by referring to places [19], and the definition of place has been discussed extensively (e.g., [20,21]). Compared to a space-based perspective of geography, place is regarded as space infused with human meaning and experience, and thus, enables conversations [22]. Place is also regarded by some as the prototypical spatial reference in human, economic, and culture geography [23].
Place, as a cognitive concept, is inherently vague, and this vagueness is evident in human cognition, perception, as well as natural language descriptions [24]. Some researchers believe the concept of place may be too vague to be formalized, except in narrow circumstances [14]. It has been argued that places do not have any natural boundaries, and are locations that have been given shape and form by people [21,25]. Agnew [26] suggests thinking of place in relation to other places, instead of bounded and isolated features. In contrast, geographic information systems (GIS) and services are developed on unambiguous, crisp, and metric geometries removed from human concepts [27], and has therefore limited ability in modelling and utilizing place information [28]. In short, place, while fundamental to human cognition and communication, is still well beyond the reach of current information systems.

2.2. Place Models from an Information System Perspective

Web-based place services such as Google typically use gazetteers to store the locations of places by point. A gazetteer, which is a dictionary of geographic names, contains three core components: place names, feature types, and footprints [29,30]. A gazetteer stores usually the official or authoritative place names, and sometimes also stores alternative names such as vernaculars. A place type is a category from a feature-type thesaurus for classifying places according to their semantics, and is often biased towards political or commercial entities and geographic features with large extents. A footprint represents the location of a place, typically by a single coordinate pair as an estimated centre of a place, which is not capturing its extension and often inappropriately precise [21]. Gazetteers are widely utilized for both enterprise and academic purposes such as geographic information retrieval (GIR) [31,32,33], navigation services, and web-mapping applications.
Capturing the uncertainties associated with places is one of the challenging tasks of place-based research. Vague place references such as ‘The Midwest’, or the ‘The Alps’ do not have natural boundaries for defining their spatial extents. Some models have been suggested to cope with such uncertainty, such the ’egg-yolk’ model [34], fuzzy sets [35], rough sets [36], broad boundary regions [37], and supervaluation [38]. However, none of these approaches have been widely adopted, and they are argued to be inadequate due to the fact that they focus on the boundary rather than on the central notion of the place [21]. The same place reference can refer to different spatial extents in different contexts. For example, ‘Federation Square’ can refer to either a set of connected places at the banks of the Yarra River, or the limited open area at the centre of these places. Other studies focus on capturing the uncertainty of the locations of places instead of their boundaries. Methods were developed to combine locality uncertainty measurements from different sources, such as imprecise distance and direction [39,40]. Uncertainties of qualitative spatial relationships in natural language locative expressions of places has also been studied [41].
Probabilistic and membership-based approaches, some with the help of web-harvesting techniques, have been used to visually represent the footprint of vague places. Such representations visualize the degrees to which any location belongs to these places. Montello and Goodchild conducted a study to determine the footprint of down town Santa Barbara by asking participants to draw the boundary of it and aggregating the results [42]. Later, data-driven methods were proposed using techniques such as Kernel Density Estimation and clustering, based on geotagged social media content of place names or tags [43,44,45,46]. Other studies focus on deriving continuous surfaces to represent places [47,48]. Such field-based representations computationally characterize the inherent uncertainty of the extent of vague places, and enable approximate crisp boundaries to be derived for place-based applications.
Winter and Freksa suggest to capture the cognitive and linguistic nature of a place in contrast to other places that are relevant to the discourse [21]. Places function as spatial anchors, and are determined by their relationships to other places in the environment. This idea refers to a sense of distinctiveness [49] and wholeness. Such contrast sets of places can either be explicitly mentioned in the discourse, implied, or pre-exist as shared knowledge. Using contrast sets, Vasardani et al. provided a conceptual model to interpret the region implied by preposition at; the contrast sets make the results context-sensitive [50].
Non-spatial place information such as place semantics, equipment, characteristics, and affordance can also be useful in applications such as place searching and querying, and some of them have already been studied. For example, it has been argued that place affordance is a core component for defining place and designing ontologies, and several works have attempted to formalize it [51,52,53]. Semantical categories used in gazetteers can be regarded as taxonomies according to place affordance, although sometimes lack flexibility and interoperability [54].

2.3. Modelling Place Descriptions

The structure of place descriptions has been studied in linguistics and spatial cognition research (e.g., [55,56]), and recently in the domain of GIScience as well. For example, Richter et al. found that place descriptions typically apply hierarchical structures from different granularity levels [57]. Place descriptions provide a qualitative reference system for describing geographic locations using references to places (often as place names) and spatial relationships. Computers have difficulties recognizing and interpreting such structures, often due to vernacular place references and flexible relationship expressions [19].
Several models for processing spatial languages have been proposed. The GUM ontology [9] is a comprehensive linguistic model for decoding the semantics of linguistic spatial expressions. SpatialML [10] has the goal of annotating places referred in text as well as spatial relations. Similarly, spatial role labelling [6] focus on extracting spatial objects (as a superset of places) and spatial relationships from text. The annotated places and spatial relations (which can be represented by triplets) by these models can perfectly be mapped to the original place graph model without much difficulty. However, such mapping still brings the same issues as identified in the introduction section, e.g., the loss of context for qualitative distance and relative direction spatial relationships. Moreover, certain information considered by some annotation models, e.g., motion and path in the ISO-Space annotation language [58] as well as the GUM ontology, is not directly supported by the original place graph model either. This research also considers modelling information that is not considered in these previous models, such as non-binary spatial relationships and place affordance.
Vasardani et al. study place references and spatial relationships embedded in locative expressions, which can be extracted by a parser in the form of triplets [1]. A place reference can be a place name (also called a toponym, e.g., ‘Paris’) or a count noun (typically a place category, e.g., ‘the library’) [20]. It can also be in other vernacular forms (e.g., ‘the meeting place’). References that are not official place names are more challenging to locate and typically require considering conversational contexts. A triplet provides location information of a place by giving a relationship to another places as an anchor. The spatial relationship expressed in a locative expression is often a preposition (e.g., on, in or at), but can also be a verb (e.g., surrounding), a phrase, or even be implicit (e.g., Sydney, NSW).
The locatum and relatum in a triplet have also been called as a locatum and a reference object [6] in the spatial semantic parsers’ literature [59,60]. In other disciplines such as cognitive linguistics and psychology, a spatial entity whose location is of relevance has also been called a trajector [61,62], a figure (a Gestalt-psychological term meaning a perceptional identifiable object in comparison to the ground) [63,64], or a general term referent. It has also been called a target as the subject of a spatial relation, and the corresponding object of the relation is called a landmark [65]. The term landmark has also been used to describe the reference entity in relation to which the location of a trajector is specified [61]. Generally, a relatum, or a landmark, is an entity whose location is better known thus in relation to which the location of the locatum can be specified. A locatum defined by these terms are generally similar: it may be static or dynamic; a person or an object whose location is of relevance. In the literature there are also minor differences in the definition of these two terms. For example, an event can become a locatum as specified in [61]. Differences also exist for the definition of a relatum. For example, a deictic centre as a point of reference is regarded as a landmark in [61]. In this research, we consider only static, identifiable objects.
Place graphs from triplets can have multiple edges between each pair of nodes, representing different spatial relationships between the two places, because the same places may be referred to multiple times, or in several place descriptions. For each type of relationship, at most one instance is stored between any pair of nodes as an edge, and additional instances will be regarded as duplicates and discarded.
The references to the same place may also be made in different terms. Thus, nodes referring to identical places should be identified and merged. For this purpose, a graph-merging approach considering string and semantic similarity as well as similarity of spatial relationships to other places has been developed [11]. An example of a merged place graph is shown in Figure 2, constructed from multiple descriptions of the same environment. Place graphs have been leveraged to locate non-gazetteered references to places [66], create sketch maps [67], and identify landmarks [12].

3. Extending the Place Graph Model

This section first analyses types of information that are not captured in the original place graph model, as well as the tasks for which they matter. Then, an extended place graph model that caters for this information is introduced.

3.1. Information not Captured in the Original Place Graph Model

The types of information identified below are not considered in the original model. Most of them provide contextual knowledge, which could affect the interpretation of other information communicated in place descriptions (e.g., spatial relations), and, thus, should be captured. The definition of context is task-specific; in this research we adopt the categorization proposed by Wolter and Yousaf [68] of description-, environment-, and human-dependent contexts. For instance, near can refer to different distances according to other places relevant to the discourse [21] (description-dependent context). Certain relations require information from the environment in order to be interpretable, e.g., ‘two blocks down the street’ (environment-dependent context). Places and spatial relations can also have different semantics for different individuals (human-dependent context).
Figure 3 shows the UML diagram of the original place graph model.

3.1.1. Place Semantics and Characteristics

Place descriptions sometimes contain non-spatial information about places, such as their types (e.g., ‘the room is a lecture theater’), the activities they afford (e.g., ‘having seminars and lectures’), the things they equip (e.g., ‘the room has a projector’), as well as their characteristics (e.g., ‘old, arge’) [69,70].
Place semantics and affordances have been used for characterizing places and enabling place-based search as well as analysis [48,70,71]. Different places may have the same affordances, and one place may have multiple affordances according to individuals or time periods. The way that a gazetteer categorizes places does not always align with the way people regard these places, despite that such categorization is useful in many applications. Capturing semantics and characteristics of places in a place graph could provide additional dimensions for tasks such as georeferencing, identical-place matching, and querying.
In place descriptions, these types of information are often expressed in certain patterns, e.g., as adjectives, nouns followed by words such as ‘is’ and ‘has’, or as verb phrases. Such patterns can be recognized using a trained parser, and the feasibility of creating such a parser has been demonstrated in previous research (e.g., [70,72]).

3.1.2. Places and Relationships from Discourse and Their Sequential Order of Appearance

Places referred to in different discourse provide contextual knowledge for interpreting spatial relations and locating places. For instance, near in the description ‘the building is near the Flinders Street Station’ can be interpreted differently in terms of distance, depending on the spatial context (the geographic extent the description is embedded in), e.g., the limited area around the station, or the whole Melbourne CBD. Such a spatial context can be inferred by looking at the places mentioned in the same discourse.
Other than places, spatial relationships from the same discourse provide contextual knowledge as well. For example, relative direction relationships can be used to infer the reference direction used by the descriptor, especially when using local landmarks as relata. The inferred reference directions can help with interpreting other relative direction relationships in the discourse, and thus be used to locate places as locata of these relationships.
The order of appearance of places and relationships in a place description should also be preserved. For example, descriptors often switch the level of spatial granularity monotonically, e.g., changing from city-level to district-level [57]. Such changes in context cannot be detected without recording the order of appearance of place references and spatial relationships. Similarly, reference directions can also change within a description, for example at turns, and affect the interpretation of subsequent relations.
Storing sequential order also helps linking different place references that are referring to the same place. Definite references such as ‘the building’, which refers to a building described previously in the discourse, can be ambiguous without sequential appearance information, if there were multiple buildings mentioned in the discourse.
Information about places and relationships from the same discourse, as well as their sequential order of appearance is not modelled in the original place graph model. Triplets from different descriptions are merged without any indexing mechanisms for future separation. The two types of information can be obtained directly without requiring an additional parser; the challenge is how to modify the place graph model in order to store this knowledge.

3.1.3. Reference Frame and Direction

The original place graph does not capture spatial reference frame and reference direction information [64,73]. Anchoring relative direction relations is, thus, problematic, as it is unknown which directions are being referred to. It is also difficult to perform qualitative spatial reasoning (QSR) [74,75] or to interpret seemingly contradicting direction relations, as in the example <the Arts Faculty Building, left, the Old Quad> and <the Arts Faculty Building, right, the Old Quad>, without knowing the reference directions used in both situations.
Reference frames in natural language have been classified in the literature [73]. In this research, a relative direction reference frame is defined to be either intrinsic or relative. For example, the expression ‘the café is in front of the library’ is likely to use the intrinsic reference frame of the library, which has a front, while ‘you will find the library to the left side of the lawn’ is likely to use the relative reference frame of the walking direction, since the lawn has no front (or left). A parser for identifying reference frames and heading directions is not yet available. Nevertheless, we will demonstrate how these two types of information can be modelled in an extended place graph, as well as how they can be leveraged in application scenarios.

3.1.4. Non-Binary Relationships

Non-binary spatial relationships, e.g., across, between, around, and among, involve more than two places thus cannot be represented by the aforementioned triplet structure. Vasardani et al. suggested for the original place graph that ternary relations can be modelled by two edges linking two relatum nodes to the same locatum node [1]. However, these edges are not indexed hence can become ambiguous when several of these relationships are known for one place. Furthermore, several of these relationships can have more than three places involved. For non-binary relationships, the task in this research is how to properly model them in order to preserve the original semantics and to allow future tracking.

3.1.5. Number of Occurrences of Place References and Spatial Relationships

The original place graph does not store the number of times each place reference is used to refer to a place, and thus, the information of which references are more frequently used for a place is lost. Storing the number of occurrences for place references can distinguish between common (popular) names and less-frequently used ones. It also enables analysing which references are more often used in certain conversational contexts, description themes, or by certain people.
The number of occurrences of each relation being used between two places is not recorded in the original model either, as only one instance for each relation can exist between any two nodes. As a result, if two contradicting relations north of and south of between two places have both been stored in a place graph, it is impossible to determine which one is more likely to be the true, according to frequencies. By preserving the number of occurrences for each relation, the one that occurs more often can be regarded as a better-agreed upon assertion and, thus, more likely to be true.

3.1.6. Conceptualization of Places

According to Lynch’s classification of elements of the city [76], a place from an urban environment can be conceptualized as a node (a strategic spot that is accessible), a path (a channel that affords movement of the observer), a district (an accessible and identifiable area), a landmark (an inaccessible place typically for spatial referencing), or an edge (an inaccessible boundary), as a 0D, 1D, or 2D object. The classification has been adopted in GIScience, such as for describing the functional spatial structure of urban environments using graphs [77].
The sense of place emerges as it is functionally different from its surrounding environment and, thus, becomes distinguishable. The functional difference between places is sometimes revealed by place conceptualization in descriptions, and such difference is context-dependent. The same place can be conceptualized differently in different description contexts or even within the same description [21], depending on what information the descriptor wishes to convey. For example, a district can be regarded as a 2D container for describing places within it, or being regarded as a 0D landmark for locating other nearby places, either from the same granularity level or not.
We argue that capturing the conceptualization of places in descriptions allows for better interpretation of the information communicated. For example, the same description ‘the place is to the north of the campus’ can either be interpreted as an external cardinal direction relationship (mapped as north and disjoint) or an internal one (mapped as north and inside), depending on the conceptualization of the relatum, i.e., if the descriptor is regarding the campus as a landmark and describing places nearby, or as a container and describing places within. In the examples, the conceptualization of a place can be regarded as a variable that affects the mapping of vernacular spatial relationship expressions to formal relations. Without capturing place conceptualizations, the mapping process becomes either risky or unrestrictive.

3.1.7. Route and Accessibility

Some place descriptions can take the perspective of a route description. Route descriptions are often associated with reference directions and accessibility information for navigation purposes. For a triplet in a route description, the accessibility from the relatum to the locatum is usually implied. Accessibility also determines whether the triplet belongs to a part of a route or not. For example, Moncla et al. use motion expressions to distinguish places that are only seen and places that can actually be reached from hiking descriptions, in order to reconstruct itineraries [78]. In the GUM ontology [9], a relationship indicating accessibility is classified as a GeneralizedRoute, and is distinguished from a GeneralizedLocation which does not belong to a route.
Tracking whether places and relationships originated from a route description enables querying of path knowledge for purposes such as navigation support. Moreover, as the number of occurrence of relationships is also preserved in an extended place graph, it is also possible to identify prominent routes that are described more often by people.

3.1.8. Description Context and Source Context

Some relations can vary with context. For example, Yao and Thill identified several contextual variables that determine the choice of qualitative distance relationships, e.g., the current type of activity, and the available mode of transportation [79]. Furthermore, the places referred to in a place description are depending on the purpose of the description. Therefore, information of the theme of a description is useful for place analysis. For example, Kim et al. observed the differences of place occurrences in descriptions of four themes: environment, business, travel, and other [12]. In their implementation, places from their place graphs have to be manually re-link to their original descriptions, as the correspondences between places and occurrences in descriptions are not kept when the graphs were constructed. Identifying thematic topics of textual documents can be done using existing techniques such as topic modelling [47,48].
From a database perspective, the metadata of a place description, e.g., its source and time-stamp, should also be preserved. Such knowledge can be useful when determining the reliability or time validity of the extracted knowledge. Thus, original speakers and recipients form a facet of context. Individuals may describe the same environment differently, in terms of the selection of place and place semantics, spatial relations, reference frame, and conceptualization. The intention of giving a description to a recipient (or recipients) may also influence how a description will be organized [68]. Human-level factors such as age, gender, ethnicity, and degree of familiarity with the environment have also been identified as being influential on the meaning of the spatial relationships communicated [79]. In addition, the affordance of a place also varies among individuals. For instance, a supermarket may afford for some people shopping, and for others (or at other times) work.
Linking descriptions with people allows richer types of queries and analysis to be performed on an extended place graph, e.g., what places are more frequently mentioned by whom. As another example, an extended place graph database supporting an autonomous vehicle can be used to establish links between passengers’ accounts and the place each one of them calls my home. Such human-related information is often unavailable or limited, for example when descriptions are collected from online documents.

3.2. The Extended Place Graph Database Model

The extended place graph database model is illustrated by a UML diagram shown in Figure 4. The model preserves all the additional information specified so far and is designed to support efficient querying through graph traversal. Each class in the diagram represents a node type in an extended place graph, and each relationship represents an edge type. All node types and some edge types are associated with properties. Values of some of the properties will be set to null if the corresponding information is not (or not yet) available. For example, properties such as footprint of a place node cannot be obtained directly from place descriptions, and must be derived using georeferencing techniques [66].
The reason to apply graph database model is that, it opens opportunities for new applications that rely on graph traversal, e.g., qualitative spatial reasoning regarding place graph as a constrained relational network, landmark identification considering node degree and centrality, measuring place relevance by graph distance, or queries by relationship and path. Graph databases have computational advantages in storing and querying (by relationship) richly connected data compared to relational databases. For example, if the UML diagram in Figure 4 were modelled by a relational database, queries such as finding the shorted path of spatial relationship connecting two place nodes would require complex joining operations and may not be done within a reasonable amount of time.
An n-plet is an extension of a triplet, and each place reference that occurs in a place description is regarded as being embedded in an n-plet. An n-plet is often a triplet representing a binary relationship; however, it can also represent a non-binary relationship, e.g., between, around and across, having multiple locata and relata based on the sequential order of appearance in the description. An n-plet can also consist of only one locatum without any relatum, as a place reference may not be embedded in any locative expressions in a description, e.g., ‘Melbourne is a populous city’. Thus, an n-plet must have at least one locatum, and any non-negative number of relata. In the remaining part of the section, each type of node, edge, and the associated properties are discussed.

3.2.1. Place Reference Node

A place_reference node represents a reference to a place from an n-plet in a description, either as a locatum or a relatum. Each place reference node must have one and only referred_by incoming edge from a place node. Between place nodes and place reference nodes are n:1 relationships, i.e., a place may be referred to by one or more different place references, while the same place reference may be used to describe different places in different contexts (but modelled by distinct place reference nodes). For example, two references ‘Flinders Street Railway Station’ (an official place name) and ‘the train station’ (a non-gazetteered reference) come from conversational contexts where they refer to the same place (Flinders Street Railway Station). In a different context, the reference ‘the train station’ may refer to another train station.
A place reference node, when created, is by default linked to a new place node instance through a referred_by edge. A merging algorithm [11] can then modify the correspondence by removing the newly created edge and place node instance, and establishing another referred_by edge between the place reference node and a pre-existing place node, if it is determined so.
Since place references are embedded in n-plets extracted from place descriptions, each place reference node has one and only outgoing edge in to an n-plet N_plet node. An in edge has two mandatory properties: pos and as. The value of as can either be locatum or relatum, representing whether the place reference is corresponding to the locatum or the relatum of the n-plet. The property pos is a positive integer denoting the index of the occurrence of the place reference, as it is possible that an n-plet has multiple locata or relata. For a triplet, the value of pos is 1 for either of the two place reference nodes it links to.
Thus, a place reference node is defined by Axiom (1) below:
Place _ reference referred _ by - . Place in . N _ plet
A place reference node has six properties: place_reference, conceptualization, place_type, equipment, characteristic, and affordance. Among the properties only place_reference is mandatory. The value of conceptualization is one of the categories based on Lynch’s classification: node, path, district, landmark, or edge. Values for the remaining four properties are unrestricted, and some examples are given in Section 3.1.1. The data type of these four properties is string list, as multiple values of each of these properties can be described. These properties are not stored under place nodes in order to preserve the context of where and by whom these values are given.

3.2.2. N-Plet Node

An n-plet node is defined by Axiom (2) below. Other than in as already explained, each n-plet node has one and only edge from as an outgoing edge, denoting the description from which this n-plet is extracted. The edge from has the same property pos as in, showing the sequential order of appearance of an n-plet in the description. An n-plet node can have one or more in and map edges, depending on the number of locata and relata, and the number of mapped formal spatial relations mapped respectively.
N _ plet in - . Place _ reference from . Description map . Spatial _ relation
An n-plet node has two properties: spatial_relation_expression and reference_frame. The first one stores the original spatial relationship expression used for the n-plet in the description. In an original place graph, such expressions are formalized by a controlled vocabulary before graph construction [67], yet it is quite often that the same spatial relationship expression can be mapped to different formal relations depending on the context. Therefore, in an extended graph, the original spatial relationship expressions are kept, and the mapped relationships will be stored separately as spatial_relation nodes linked by outgoing edges map from n-plet nodes.
If the spatial relation expression of an n-plet is mapped to a relative direction relation, the value for the property reference_frame can either be intrinsic, relative, or null (undetermined). The intrinsic value means the relative direction is based on the intrinsic direction of the relatum (e.g., ‘in front of the building’), while the relative value means a non-intrinsic reference direction is adopted. If the value relative is used, the n-plet node will have an additional outgoing edge has_reference_direction to a place node referred in the discourse, anchoring the reference direction used for the n-plet. A has_reference_direction edge has a property as, and the value is one from {front, back, left, right, left front, right front, left back, right back}. An example is given in Figure 5, with description:
“… coming from the Main South Entry, the Baillieu Library will be on the left hand side of the South Lawn …”
The Appendix A of this paper provides a complete example of constructing an extended place graph from a description, with the example sentence above as part of the description.

3.2.3. Place Node

Each place node represents a place. In an extended place graph, a place is identified from one or more place descriptions by place references embedded in n-plets. A place node does not have any place references stored; however, all the references used for referring to it (as well as the number of occurrence for each reference) can be obtained easily from all the place reference nodes it is connected to through outgoing referred_by edges. A place node is defined by Axiom (3):
Place referred _ by . Place _ reference
A place node has three derived properties. The value of footprint represents the location of the place, and can be either a point, a polyline, a polygon, or an approximate location region (ALR) [66] which can also be made crisp and represented by a polygon. An ALR is a region derived using spatial relation search space models (including formal and probabilistic models) for georeferencing places without gazetteered references. The value of property type denotes the data type of the footprint, e.g., polygon. The property spatial_granularity is a classification of the spatial granularity of the place based on the categories found in [80]: {furniture, room, building, street, district, city, country}.

3.2.4. Route Node

Places referred to as part of a route are grouped by linking their corresponding place nodes to a route node through part_of edges. The property pos of a part_of edge records the position of the place reference in the route by sequential order of appearance, and the value is a positive integer.

3.2.5. Spatial Relation Node

Each spatial_relation node represents a formal spatial relation. Unlike a value of the property spatial_relation_expression stored in an n-plet node, which can be expressed in flexible ways, formal relations are from a controlled vocabulary. Binary formal relations from four families are considered, as listed in Table 1. The vocabulary of non-binary relations in not restricted, since non-binary relations have not yet been well defined in literature. Accordingly, the value of property family is from one of the five spatial relation families: {qualitative distance, cardinal direction, relative direction, topological, non-binary}. The property relation stores the name of the relation.
Mapping between spatial relation expressions and formal relations is a m:n relationship. A spatial_relation_expression value can be mapped to one or more formal relations from single or multiple families, and different spatial relation expressions could be mapped to the same formal relation. The mapping process is context-dependent. For example, a spatial relation expression ‘north’ can be mapped to either north, disjoint (external north) or north, inside (internal north), depending on the original expression and place conceptualization. Compared to an original place graph, the extended model supports more flexible and context-aware reasoning of spatial relations.

3.2.6. Description Node

A description node represents a description document as a single discourse. It is used to store the global-level context variables of the descriptions from which n-plets were extracted. The four properties of a description node, i.e., theme, transportation_mode, source, and timestamp, have already been explained in Section 3.1.8.
A description node has at least one instance of an incoming edge from from an n-plet node. The property pos of from is the position of the linked n-plet in the description by appearance, and the value is a positive integer. The property spatial_context is a derived one, representing the geographic extent a description is embedded in, using the approach developed in [66]. For example, if the extracted places are landmarks in the Melbourne CBD as a suburb, the context of the original description is likely to be about Melbourne CBD. Finally, a description node can also have outgoing edges created_by and given_to to a user node, if such information is available.

3.2.7. User Node

A user node either represents a descriptor (connected by a created_by edge from a description node) or a recipient (connected by a given_to edge). The same user node can be connected to multiple description edges by either roles. The property info of a person node is not restricted in this research, as what information is useful for the application of an extended place graph is domain- and task-dependent. Examples have been given in Section 3.1.8. The property value is defined in the format of JSON as key-value pairs.

3.3. Summary

The purpose of the extended place graph model is to capture information ignored in the original place graph, in order to overcome the graph’s limitations. The extended place graph captures the five core concepts of spatial information proposed by Kuhn [81]: location, field, object, network, and event. The graph stores the location of a place, and the probabilistic ALR derived using the approach in [66] represents the approximate location of the place as a field, based on the relational knowledge stored in a graph. Different places are modelled as node objects in a place graph, characterized by information including place references and semantics. An extended place graph also forms a network by representing the links not only between places, but also between places and descriptions, as well as between places and people. Such links can be strengthened given additional descriptions, as their times of co-occurrence are captured as well. Finally, an event involves aspects of people, time, location, and activity, and these aspects are covered by node properties info (of user nodes), timestamp (of description nodes), footprint (of place nodes), and affordance (of place reference nodes) respectively.

4. Implementation and Experiments

This section describes the implementation of the extended place graph model, as well as three experiments to demonstrate how an extended place graph outperforms the corresponding original one. We do not use all the identified information in Section 3.1 in the experiments, since the goal is not to provide a comprehensive evaluation, but rather to demonstrate the superiority of the extended model. We also discuss additional challenges and insights.

4.1. Data Overview and Construction of the Test Place Graph Database

The description dataset used in this research are place descriptions submitted by 42 graduate students about the University of Melbourne campus environment. The data set consists of over 9000 words, 726 n-plets, and a total of 241 distinct places being mentioned. A place can be referred to by multiple place references in different descriptions, or even within the same description. Some places are not referred to by any gazetteered names due to two reasons: the place itself is not captured in the gazetteer, possibly due to granularity (e.g., ‘the small courtyard’, ‘the dean’s office’); or the place is only referred to by synonyms or other vernacular names as commonly-used references (e.g., ‘the mathematics department building’). A part of a description is shown below:
“… If you go into the Old Quad, you will reach a square courtyard and at the back of the courtyard. You can either turn left to go to the Arts Faculty Building, or turn right into the John Medley Building and Wilson Hall. Raymond Priestly Building is the open aired ground area which is in front of Wilson Hall that is adjacent to it. Towards North, which is when you turn left when exiting the Old Quad, you will see Union House where there are shops selling foods. If you continue walk along the road on the right side where you’re facing Union House, you can see the Beaurepaire and Swimming Pool. There will also be a sport tracks and university oval behind it …”
A parser was used in previous studies to extract triplets from the dataset [8]. The extracted triplets are stored in a csv file with three columns: locatum place reference, spatial relation, and relatum place reference. However, such a csv file does not preserve the necessary information for constructing an extended place graph. Therefore, we create a JSON file in order to capture additional information. For the following experiments, the information of reference direction is required as well. Due to lack of a parser capable of extracting this information, a graduate student was asked for manual annotations to obtain the information. To minimize the influence of pre-existing local knowledge on the annotation process, all place references were replaced by five place types: building, spot (any place finer than a building), area, alley, and street. For example, the first building name that occurs in a description is anonymized as Building_1. The task of the student was to assign each relative direction relationship with three property values: reference frame (intrinsic or relative), the anonymized reference of the place indicating the reference direction (e.g., ‘Main South Entry’ in Figure 5), and the reference direction (value of property as for edge has_reference_direction). The structure of the final input JSON file for creating the test place graph database is shown below. Again, for a complete graph construction, please refer to the Appendix A.
          {"descriptions": [
              …
              {"did": 1,
              "nplets":
                 [{"nid": 2,
                 "locatum␣reference": "Baillieu Library",
                 "spatial␣relation␣expression": "on the left hand side",
                 "relatum␣reference": "South Lawn",
                 "reference␣frame": "relative",
                 "reference␣direction": ["Main South Entry", "back"],
                 "relation␣map": ["left"],
                 …},
                 {…}, …]
              },
              {…}, …
          ]}
        
A place graph database management system interface has been implemented using Neo4j (https://neo4j.com/product/) and Python, as shown in Figure 6. The system is able to perform tasks including description parsing, graph creation from JSON, graph visualization, georeferencing, qualitative reasoning, querying, and mapping. Some functions are necessary for the following experiments and will be explained in detail in the remaining part of this section.

4.2. Experiment I: Locating Places Without Gazetteered References

An approach has been developed previously to georeference places in an original place graph [66]. The approach first identifies and disambiguates places with gazetteered, i.e., well known references, called anchor places, based on gazetteer look-up and a clustering-based disambiguation approach. Then, the anchor places are used to derive the spatial context of the graph. Next, an ALR is derived for each remaining place node by intersecting the spatial context, as well as the search spaces for the spatial relationships between this place and the anchor places. Search spaces can either be based on formal or probabilistic models. An illustration is provided in Figure 7 based on formal search space models. The location of place b: Federation Square is represented by the shaded region, which is derived by intersecting the search spaces of the three relationships: east of a, south of c, and near c.
In this experiment, three ALR refinements are possible, leveraging two of the newly captured information: identifying places from the same discourse, and reference direction.
The first refinement is by separating spatial contexts for individual descriptions. An illustration is shown in Figure 8 with two descriptions, each contains four anchor places. The spatial contexts of the two descriptions are different, as shown in the right-side figure (e.g., description one is describing a street block in a city, and description two is about the whole city region). Dash lines in Figure 8 indicate spatial contexts, which are derived by the clustering algorithm based on the disambiguated locations of the anchor places. Using the original place graph model, there is no way to identify which places are originally from the same descriptions, and thus, only one spatial context can be derived, as shown in the left-side figure. Consequently, the ALRs generated for places in the first description will be inappropriately large if not further constrained by spatial relationships. In comparison, an extended place graph allows tracking places from the same descriptions, since the links are now preserved. Therefore, in this refinement, separated spatial contexts regarding each individual description will be derived, as shown in the right-side figure. The refined spatial contexts will then be used for deriving ALRs, which are expected to be more constraining and thus more useful for georeferencing.
The second refinement is by anchoring relative direction relations using the newly captured reference direction information. Search spaces for relative direction relations can only be defined as buffered regions similar to near information, as shown in Figure 9 (left). The reference direction of a relative direction relation can only be anchored, if the locations of the relatum and the place indicating reference direction (i.e., the place linked by the has_reference_direction edge) are available. The proposed refined search is illustrated in Figure 9 (right), with front indicating the reference direction, and the shaded regions representing search spaces.
Third, the interpretation of qualitative distance relationships, such as near, can be contextualized by considering other relevant places mentioned in the discourse, based on the theory of contrast sets [21]. An example is given in Section 3.1.2. In this experiment, the contrast set of a relatum is defined as the set of places that are of the same granularity level as the relatum, and have been used as relata in the same description. The underlying assumption is that when a descriptor says ‘A is near B’, it is often implied that A is closer to B than any of the other places used as relata in the discourse, and are of the same granularity as B. Similarly, Vasardani et al. provided a first interpretation of the preposition at by contrast sets using their Voronoi diagrams [50]. In this research a similar approach is adopted, as illustrated in Figure 10. The shaded region indicates the search space derived based on a contrast set, and which is intersected with other search spaces to derive the ALR of the locatum in the georeferencing process.

4.3. Experiment II: Relational Consistency Reasoning Using Reference Direction Information

This experiment aims at leveraging reference direction information to determine the relational consistency of relative and cardinal direction relations stored in an extended place graph, e.g., determining whether the two relationships <the Arts Faculty Building, left, the Old Quad> and <the Arts Faculty Building, right, the Old Quad> are contradicting, or not. This experiment is not about checking global, but rather local consistency. Relational composition is also not considered.
Two complementary reasoning approaches are suggested in this work. One is based on search space intersection, and the other relies on reference direction transformation. A illustration of the first approach is given in Figure 11, where the shaded regions are search spaces as explained in the previous experiment. The approach first finds all directional relationships between two places in an extended place graph, and derives their search spaces. If any pair of these search spaces does not intersect, an inconsistency is flagged.
However, deriving such search spaces requires the location information of the relata as well as the places indicating the reference directions, which may not always be available. Therefore, a complementary qualitative reasoning approach is proposed using reference direction translation rules. An illustration is provided in Figure 12. Given the knowledge in (a), consistent relationships can be inferred as shown in (b) and (c), through translations of the reference direction. Thus, given another relationship ‘A is right of B with C as back (meaning with C indicating the back direction of the reference direction)’, it can be identified as inconsistent with (a), (b), or (c). The drawback of this approach is that it is only applicable to scenarios with up to three places, while the previous approach does not have this limitation. The full reasoning procedure of this experiment is described in Algorithm 1.
Algorithm 1 Consistency reasoning of directional relationships.
Input: place _ graph
Output: inconsistent _ pairs
 1:
inconsistent _ pairs : =
 2:
for relationship in getDirectionalRelations( place _ graph ) do
 3:
     search _ space = computeSearchSpace( relationship )
 4:
     locatum , relatum , reference _ direction _ place = getPlaces( relationship )
 5:
 
 6:
     others = getDirectionalRelationsBetween( locatum , relatum )
 7:
    for new _ relationship in others do
 8:
         new _ search _ space = computeSearchSpace( new _ relationship )
 9:
        if intersect( search _ space , new _ search _ space ) == False then
10:
            inconsistent _ pairs ( relationship , new _ relationship )
11:
        end if
12:
    end for
13:
 
14:
     others = getDirectionalRelationsWith( locatum , relatum , reference _ direction _ place )
15:
    for new _ relationship in others do
16:
        if new _ relationship not in transformation( relationship ) then
17:
            inconsistent _ pairs ( relationship , new _ relationship )
18:
        end if
19:
    end for
20:
end for
21:
inconsistent _ pairs .removeDuplications()
22:
return inconsistent _ pairs

4.4. Experiment III: Spatial Knowledge Querying

All the queries to the original place graph can still be performed in the extended place graph, e.g., ‘find the relationship between two places’, or ‘find places that have relationships to a particular place.’ An extended place graph additionally supports queries complex queries, such as finding in which descriptions a particular place occurred, as well as other places mentioned in the same descriptions ranked by their co-occurrence frequency.
Cypher queries are used in an extended place graph. A cypher query is a graph traversal algorithm that attempts to find nodes or edges that match certain label (place and edge type) and properties values. In this research, results are returned by criteria string matching.
Three query examples that cannot be answered with the original place graph are selected. The queries as well as their corresponding cypher queries are shown below.
  • Find the most frequently referred to relatum (landmarks).
  • Find places that are most frequently linked to a specific place by spatial relations (place relevance by co-occurrence).
  • Find the most frequent paths of length three, consisting of only directional relationships, i.e., place A-relation a->place B-relation b->place C (prominent routes)
  MATCH (p:place)-->(:place_reference)-[r {as: ’relatum’}]->(:n_plet),
  RETURN p, count(r) AS c
  ORDER BY c DESC
  MATCH (p:place)-[*2]->(n:n_plet)<--({place_reference: ’Alice Hoy’})<--(:place)
  RETURN p, count(n) AS c
  ORDER BY c DESC
  MATCH path=(p1:place)-[*2]->(n1:n_plet)<-[*2]-(p2:place)-[*2]->(n2:n_plet)<-[*2]-(p3:place),
  (n1)-->(r1:spatial_relation), (n2)-->(r2:spatial_relation), (n1)-->(:description)<--(n2)
  WHERE r1.family in [’cardinal_direction’, ’relative_direction’] and
  r2.family in [’cardinal_direction’, ’relative_direction’]
   RETURN p1, r1, p2, r2, p3, count(path) AS c
  ORDER BY c DESC

5. Results and Discussion

This section discusses the results of the three experiments. In Experiment I, places without gazetteered references have been georeferenced using five methods, namely baseline (without using any refinement approaches), SC (applying only the spatial context separation refinement), RF (applying only the reference direction refinement), CS (applying only the contrast set refinement), and Hybrid (all of the three refinements are applied). Refinement methods can only reduce the size of the ALR derived for a place. Thus, for each of the latter four refined georeferencing methods, four results are possible when georeferencing a place:
  • The size of the ALR is reduced compared to the one from the baseline, but both ALRs capture the ground-truth location of the place (Case 1).
  • There is no change in the ALR’s size (Case 2).
  • The ground-truth location is not captured in the either ALR (Case 3).
  • The ground-truth location is captured by the ALR of the baseline method, but not in the reduced-size ALR (Case 4).
Figure 13 shows the percentages of places that belong to each of the four cases, grouped by the four georeferencing methods. Places from Case 1 are regarded as better-georeferenced, while places from Case 2 and 3 are considered as equally georeferenced, and those from Case 3 are regarded as worse-georeferenced, when compared to the baseline. Among the first three methods, SC has the largest proportion of better georeferenced places, while for the RF and CS methods the percentages are much lower. In order to get refined ALRs for the latter two methods, relative direction relationships to some anchor places with reference direction information (for the RF method), or anchor places in the discourse as members of a contrast set (for the CS method) must be available. Since that is not always the case, only part of the places to be georeferenced can benefit from these two refinement methods.
When applying RF and CS methods, some places are worse-georeferenced (Case 4). For the RF method, this is because some relative direction relation information is incorrect, either due to mistakes made by descriptors, or the imperfection of the reference direction annotation procedure. For the CS method, the worse-georeferenced cases are because some ALRs are over-refined and, thus, not capturing the ground-truth locations of places any longer.
The hybrid method has overall the largest improvement in terms of the proportion of better-georeferenced places. This is expected as the method requires only information that can be used in any one of the three previous methods in order to make refinements. One drawback is that the hybrid method also has the largest worse-georeferenced place numbers due to error propagation. It is a trade-off problem between sizes of ALRs (the smaller the better as being more constraining) and having ALRs capturing the ground-truth location of places. A measure of the refinement in terms reduced ALRs is depicted in Figure 14, which shows in percent the ALR remaining size after refinement compared to the baseline, for individual places. A value of 0.6, for instance, means the refined ALR is 60% of the size it was in the baseline method. Only places with the available required information for refinement are included in the figure. For example, for the RF method, only places with relative direction relationships and reference direction information to some anchor places are included. The hybrid method results in the most size reduced ALRs for all places, which is also expected, given the method uses all the refining information available, combining the restrictions of the other three methods.
Places to be georeferenced are matched to gazetteer entries that fall within their derived ALR, based on the method described in [66] considering string, semantic, and spatial similarity. For example, if there is a gazetteer entry named ‘University Square’ within the ALR derived for the place reference ‘the large square’, the two place references are likely to be matched. The distance errors between ground-truth and matched gazetteer locations for individual places are shown in Figure 15 for the baseline and the Hybrid method, sorted by error size in the baseline. Large distance errors in the baseline seem to be more likely to be reduced by the hybrid refinement method. A possible explanation is that large distance errors usually correspond to large, less restricting ALRs, and in such situations a refinement is more effective. On the other hand, if an ALR derived in the baseline is already constraining, further refinement might have no, or even negative effects. For example, the peak on the left side of the axis represents a place that was correctly linked to its corresponding gazetteer entry in the baseline. In contrast, the refined ALR leaves out the ground-truth location, and causes the place to be miss-matched, resulting in an increased distance error.
In Experiment II, directional relations in the extended place graph are checked for relational inconsistency using Algorithm 1. Among the 43 directional relations (out of 726 in total) stored in n-plets, four were identified as inconsistent with other relationships stored in the database. As an example, consider the following two descriptions of the environment in Figure 16:
“… You’re now in the Old Quad … Pass through the Old Arts building and immediately look to your left—the tall building is the Babel building that, somewhat ironically, houses the languages and linguistics departments …”
“… From the Old Quad, you can go through the Old Arts building, and then turn right and walk until you come to a building called the Babel building (a 1970s yellow brick monolith) …”
The two relationships between the Babel building and the Old Arts Building from the two different descriptions are denoted here as ‘Babel left of Old Arts with Old Quad as back’ and ‘Babel right of Old Arts with Old Quad as back’, respectively. The first description is not true as can be verified from the map in Figure 16. The algorithm successfully identified the two relationships as being inconsistent. Note that this reasoning mechanism only flags inconsistent relationship pairs, instead of deciding which one is true.
Table 2 shows results generated for the three selected queries, in Experiment III. The second column ranks the most frequent relata in the test dataset, which can be regarded as local landmarks considering their prominence in the descriptions. In the original place graph, landmarks can only be identified by unique relationships, as the number of occurrences of neither place references, nor instances of the same spatial relation is preserved. For example, the relationship <Old Arts, right, Baillieu Library> was described more than ten times in the dataset. The original place graph stores the relationship once, which from a knowledge base perspective leads to loss of information about frequently mentioned, prominent relationships.
The third column shows places that are most frequently linked to Alice Hoy, a place from the test place graph being used here as an example. Again, such co-occurrence knowledge is not captured in the original place graph. The last column shows the top five most frequent length-3 paths, which are actually all from route descriptions.

6. Conclusions

Place descriptions occur in everyday communication as a way of conveying spatial information about place. This research proposes a graph-based approach for modelling and utilizing information from place descriptions, including references to places, spatial relationships, place semantics and various contextual knowledge. The model is a revision and an extension of the original place graph [1]. We implement the model in a graph database and demonstrate its superiority over the original place graph in three experiments, namely georeferencing, reasoning, and querying. Place graphs are regarded as place-based knowledge bases that have been used in tasks such as landmark identification and sketch-map drawing. However, place graphs constructed from triplets stripped off of context and other useful spatial or non-spatial information have restricted applications. This research proposes an extended model in order to overcome those limitations.
We identified eight types of information that are embedded in place descriptions and not captured in the original place graph model: place semantics and characteristics, places and spatial relations from the same discourse, as well as their sequential order of appearance, reference frame, non-binary relationships, co-occurrence of place references and spatial relations, place conceptualization, route and accessibility, and description context and source context. We then designed a graph database schema for modelling these types of information, allowing convenient and efficient query through graph traversal. We implemented the schema using a commercial graph database platform for three experiments, and developed a management system interface capable of multiple place graph operations.
To demonstrate the superiority of the new model, three experiments were performed on an extended place graph created from descriptions about the University of Melbourne campus. For the first georeferencing task, we propose three refinement methods as well as a hybrid method to reduce the areas of the approximate location regions generated for these places. When compared to the baseline method using the original place graph model, Figure 13 shows that more than 60% of the derived regions are reduced in size with the hybrid method, based on information newly captured in the extended graph. The reduced sizes as percentages as well as reduced distance errors for individual places are presented in Figure 14 and Figure 15, respectively. The second experiment leverages reference direction information for identifying contradicting directional knowledge in an extended place graph. The results show that the approach can identify inconsistent relationships based on both qualitative and quantitative data. In the third experiment, we demonstrate that the extended place graph is capable of answering queries that cannot be answered by an original place graph, and the results for sample queries are presented in Table 2.
The extensions of this paper were identified by our own needs for richer reasoning. We anticipate that additional future extensions are possible, since each step in the process is extensible. Thus, in future work, additional information such as place semantics could be introduced, which enriches ALR refinement and defining search spaces for non-binary relationships. Furthermore, query expansion (e.g., based on place semantic similarity and relational inference) and relevance-ranking (e.g., based on occurrence and description source) mechanisms can be further developed. Some of the identified information, however, requires new techniques or parsers to obtain this automatically, which are not readily available.
Although natural language processing is beyond the scope of this research, it is necessary for the automation of extracting several types of information identified in Section 3.1. Spatial language, although comprehensively studied in linguistic and spatial cognition, is relatively ignored in the community of natural language processing. Machine learning seems promising for spatial language processing and has been used previously for extracting place references as well as spatial relationships. For example, it can be used for extracting motion verbs indicating routes, phrases indicating human activities at places, or expressions of non-binary relationships. Techniques being able to perform information extraction (IE) and part-of-speech (POS) tagging are preferable for these types of problems, such as hidden Markov models (HMM) or conditional random fields (CRF). It is also promising to apply deep learning, which has pushed the state-of-the-art of many machine learning problems, with neural network models that were used for IE and POS tagging problems before, such as recurrent neural networks (RNN) or long short-term memory (LSTM).
The extended place graph model provides a complementary knowledge base to traditional GIS. A place-based information system can be used in applications requiring human spatial and non-spatial knowledge about places. The implemented extended place graph management system supports cypher queries and returns tabular results with high interoperability. Finally, the knowledge modelled in extended place graphs can help better understand human descriptions as input to spatial services and, thus, support intuitive and smooth human-computer interaction.

Author Contributions

Conceptualization: H.C., M.V., S.W. and M.T.; Methodology, Implementation and Experiment: H.C.; Writing—Original Draft Preparation: H.C.; Writing—Review & Editing: M.V., S.W. and M.T.

Funding

The supports by the Australian Research Council grant DP170100109, Melbourne International Fee Remission Scholarship (MIFRS), and Melbourne International Research Scholarship (MIRS) are acknowledged.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Graph Construction Example

In the example description below, references to places and spatial relationship expressions are highlighted using bold and italic font receptively. Other non-spatial information, such as place equipment and place characteristics, is highlighted by underlines. The example illustrates all types of information identified in Section 3.1.
South Lawn is the major reference point which is situated in about the middle of the campus. Coming from the Main South Entry, the Baillieu Library will be on the left hand side of the South Lawn. To the north of this you have the Old Quad (really old English style building). If you want food and are currently on South Lawn go through the Old Quad to the north and keeping heading north until you get to a Union House.”
To construct an extended place graph, the description will first be parsed into a JSON file, as shown below:
          {"descriptions": [
              {"did": 1,
              "theme": ["University of Melbourne campus description"],
              "transportation␣mode": "walking",
              "source": "collected from students",
		   
              "descriptor":
                  {"uid": 1,
                  "identity": "university student"},
              "recipient":
                  {"uid": 2,
                  "identity": "university student"},
		   
              "n␣plets":
                  [{"nid": 1,
                  "locatum␣reference": "South Lawn",
                  "locatum␣characteristic": "the major reference point",
                  "locatum␣conceptualization": "node",
                  "spatial␣relation␣expression": "about the middle of",
                  "relatum␣reference": "campus",
                  "relatum␣conceptualization": "district",
                  "relation␣map": ["inside"]},
		   
                  {"nid": 2,
                  "locatum␣reference": "Baillieu Library",
                  "locatum␣conceptualization": "node",
                  "spatial␣relation␣expression": "on the left hand side",
                  "relatum␣reference": "South Lawn",
                  "relatum␣conceptualization": "node",
                  "reference␣frame": "relative",
                  "reference␣direction": ["Main South Entry", "back"],
                  "relation␣map": ["left"]},
		   
                  {"nid": 3,
                  "locatum␣reference": "Old Quad",
                  "locatum␣type": "building",
                  "locatum␣characteristic": ["old", "English style"],
                  "locatum␣conceptualization": "node",
                  "spatial␣relation␣expression": "to the north",
                  "relatum␣reference": "South Lawn",
                  "relatum␣conceptualization": "node",
                  "relation␣map": ["north"]},
		   
                  {"nid": 4,
                  "route_id": 1,
                  "locatum␣reference": "Union House",
                  "locatum␣equipment": "food",
                  "locatum␣conceptualization": "node",
                  "spatial␣relation␣expression": "go through",
                  "relatum␣reference": ["Old Quad", "South Lawn"],
                  "relatum␣conceptualization": ["node", "node"],
                  "relation␣map": ["through"]},
		   
                  {"nid": 5,
                  "route_id": 1,
                  "locatum␣reference": "Union House",
                  "locatum␣conceptualization": "node",
                  "spatial␣relation␣expression": "to the north and keeping heading north",
                  "relatum␣reference": "Old Quad",
                  "relatum␣conceptualization": "node",
                  "relation␣map": ["north"]}]
              }
          ]}
       
As there is only one description, a description node with two user nodes will be created, as shown in Figure A1.
Then, five n-plet nodes will be created and linked by from edges to the description node, as shown in Figure A2.
Next, place reference nodes, mapped spatial relation nodes, and a route node (if the n-plet is part of a route) will be created and linked to each n-plet node, as shown in Figure A3.
Figure A1. Creation of description and user nodes.
Figure A1. Creation of description and user nodes.
Ijgi 07 00221 g0a1
Figure A2. Creation of n-plet nodes.
Figure A2. Creation of n-plet nodes.
Ijgi 07 00221 g0a2
Figure A3. Creation of place reference, spatial relation, and route nodes.
Figure A3. Creation of place reference, spatial relation, and route nodes.
Ijgi 07 00221 g0a3
Last, a place node will be created for each place reference node. The node merging approach mentioned previously can then link places that are referred by different place references from different n-plet, resulting in multiple place reference nodes linked to the same place node, as shown in Figure A4.
Figure A4. Linking different place reference nodes to the same place node through node merging.
Figure A4. Linking different place reference nodes to the same place node through node merging.
Ijgi 07 00221 g0a4
Figure A5 shows the resulting extended place graph of the whole place description generated using Neo4j.
Figure A5. The resulting extended place graph database of the place description example. Gray: description; Pink: user; Green: n-plet; Blue: (mapped) spatial relation; Red: route; Yellow: place reference; Purple: place.
Figure A5. The resulting extended place graph database of the place description example. Gray: description; Pink: user; Green: n-plet; Blue: (mapped) spatial relation; Red: route; Yellow: place reference; Purple: place.
Ijgi 07 00221 g0a5
References

References

  1. Vasardani, M.; Timpf, S.; Winter, S.; Tomko, M. From descriptions to depictions: A conceptual framework. In Spatial Information Theory; Lecture Notes in Computer Science; Tenbrink, T., Stell, J.G., Galton, A., Wood, Z., Eds.; Springer: Berlin, Germany, 2013; Volume 8116, pp. 299–319. [Google Scholar]
  2. Elwood, S.; Goodchild, M.F.; Sui, D. Prospects for VGI research and the emerging fourth paradigm. In Crowdsourcing Geographic Knowledge; Springer: Berlin, Germany, 2013; pp. 361–375. [Google Scholar]
  3. Lieberman, M.D.; Samet, H.; Sankaranarayanan, J.; Sperling, J. STEWARD: Architecture of a spatio-textual search engine. In Proceedings of the 15th Annual ACM International Symposium on Advances in Geographic Information Systems, Seattle, Washington, 7–9 November 2007; Samet, H., Shahabi, C., Schneider, M., Eds.; ACM: New York, NY, USA, 2007; pp. 186–193. [Google Scholar]
  4. Moncla, L.; Renteria-Agualimpia, W.; Nogueras-Iso, J.; Gaio, M. Geocoding for texts with fine-grain toponyms: an experiment on a geoparsed hiking descriptions corpus. In Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, Dallas, TX, USA, 4–7 November 2014; Huang, Y., Schneider, M., Gertz, M., Krumm, J., Sankaranarayanan, J., Eds.; ACM: New York, NY, USA, 2014; pp. 183–192. [Google Scholar]
  5. DeLozier, G.; Baldridge, J.; London, L. Gazetteer-Independent Toponym Resolution Using Geographic Word Profiles. In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, Austin, TX, USA, 25–30 January 2015; pp. 2382–2388. [Google Scholar]
  6. Kordjamshidi, P.; Van Otterlo, M.; Moens, M.F. Spatial role labelling: Towards extraction of spatial relations from natural language. ACM Trans. Speech Lang. Process. 2011, 8, 4. [Google Scholar] [CrossRef]
  7. Khan, A.; Vasardani, M.; Winter, S. Extracting Spatial Information From Place Descriptions. In Proceedings of the ACM SIGSPATIAL Workshop on Computational Models of Place 2013, Orlando, FL, USA, 5–8 November 2013; Scheider, S., Adams, B., Janowicz, K., Vasardani, M., Winter, S., Eds.; ACM: New York, NY, USA, 2013; pp. 62–69. [Google Scholar]
  8. Liu, F.; Vasardani, M.; Baldwin, T. Automatic Identification of Locative Expressions from Social Media Text: A Comparative Analysis. In Proceedings of the 4th International Workshop on Location and the Web, Shanghai, China, 3 November 2014; Ahlers, D., Wilde, E., Martins, B., Eds.; ACM: New York, NY, USA, 2014; pp. 9–16. [Google Scholar]
  9. Bateman, J.A.; Hois, J.; Ross, R.; Tenbrink, T. A linguistic ontology of space for natural language processing. Artif. Intell. 2010, 174, 1027–1071. [Google Scholar] [CrossRef]
  10. Mani, I.; Doran, C.; Harris, D.; Hitzeman, J.; Quimby, R.; Richer, J.; Wellner, B.; Mardis, S.; Clancy, S. SpatialML: Annotation scheme, resources, and evaluation. Lang. Resour. Eval. 2010, 44, 263–280. [Google Scholar] [CrossRef]
  11. Kim, J.; Vasardani, M.; Winter, S. Similarity matching for integrating spatial information extracted from place descriptions. Int. J. Geogr. Inf. Sci. 2016, 1, 1–25. [Google Scholar] [CrossRef]
  12. Kim, J.; Vasardani, M.; Winter, S. Landmark Extraction from Web-Harvested Place Descriptions. KI-Künstliche Intelligenz 2016, 2, 151–159. [Google Scholar] [CrossRef]
  13. Goodchild, M.F. Citizens as sensors: The world of volunteered geography. GeoJournal 2007, 69, 211–221. [Google Scholar] [CrossRef]
  14. Goodchild, M.F. Formalizing place in geographical information systems. In Communities, Neighborhoods, and Health: Expanding the Boundaries of Place; Burton, L.M., Kemp, S.P., Leung, M.C., Matthews, S.A., Takeuchi, D.T., Eds.; Springer: New York, NY, USA, 2011; pp. 21–35. [Google Scholar]
  15. Winter, S.; Baldwin, T.; Renz, J.; Tomko, M.; Kuhn, W. Place knowledge as a trans-disciplinary research challenge for Geographic Information Science. In Proceedings of the UCGIS Symposium, Scottsdale, AZ, USA, 24–26 May 2016. [Google Scholar]
  16. Egenhofer, M.J.; Mark, D.M. Naive geography. In Proceedings of the International Conference on Spatial Information Theory, Semmering, Austria, 21–23 September 1995; Springer: Berlin/Heidelberg, Germany, 1995; pp. 1–15. [Google Scholar]
  17. Goodchild, M.F. Formalizing place in geographic information systems. In Communities, Neighborhoods, and Health; Springer: New York, NY, USA, 2011; pp. 21–33. [Google Scholar]
  18. Relph, E. Place and Placelessness; Pion: London, UK, 1976. [Google Scholar]
  19. Winter, S.; Bennett, R.; Truelove, M.; Rajabifard, A.; Duckham, M.; Kealy, A.; Leach, J. Spatially enabling ‘Place’ information. In Spatially Enabling Society: Research, Emerging Trends, and Critical Assessment; Rajabifard, A., Ed.; GSDI Association: Melbourne, Australia, 2010. [Google Scholar]
  20. Bennett, B.; Agarwal, P. Semantic Categories Underlying the Meaning of ‘Place’. In Lecture Notes in Computer Science, Proceedings of the Conference on Spatial Information Theory, Melbourne, Australia, 19–23 September 2007; Winter, S., Duckham, M., Kulik, L., Kuipers, B., Eds.; Springer: Berlin, Germany, 2007; Volume 4736, pp. 78–95. [Google Scholar]
  21. Winter, S.; Freksa, C. Approaching the notion of place by contrast. J. Spat. Inf. Sci. 2012, 5, 31–50. [Google Scholar] [CrossRef]
  22. Tuan, Y.F. Space and Place: The Perspective of Experience; University of Minnesota Press: Minneapolis, MN, USA, 1977. [Google Scholar]
  23. Cresswell, T. Place: An Introduction; John Wiley & Sons: Hoboken, NJ, USA, 2014. [Google Scholar]
  24. Agarwal, P. Operationalising Sense of Place’as a Cognitive Operator for Semantics in Place-Based Ontologies. In Proceedings of the Conference on Spatial Information Theory, Ellicottville, NY, USA, 14–18 September 2005; Springer: Berlin/Heidelberg, Germany, 2005; Volume 3693, pp. 96–114. [Google Scholar]
  25. Curry, M.R. The Work in the World: Geographical Practice and the Written Word; University of Minnesota Press: Minneapolis, MN, USA, 1996. [Google Scholar]
  26. Agnew, J. Space and place. Handb. Geogr. Knowl. 2011, 2011, 316–331. [Google Scholar]
  27. Vasardani, M.; Winter, S.; Richter, K.F. Locating place names from place descriptions. Int. J. Geogr. Inf. Sci. 2013, 27, 2509–2532. [Google Scholar] [CrossRef]
  28. Adams, B.; McKenzie, G. Inferring thematic places from spatially referenced natural language descriptions. In Crowdsourcing Geographic Knowledge; Springer: Berlin, Germany, 2013; pp. 201–221. [Google Scholar]
  29. Hill, L.L. Core elements of digital gazetteers: Placenames, categories, and footprints. In Research and Advanced Technology for Digital Libraries; Lecture Notes in Computer Science; Borbinha, J.L., Baker, T., Eds.; Springer: Berlin, Germany, 2000; Volume 1923, pp. 280–290. [Google Scholar]
  30. Goodchild, M.F.; Hill, L.L. Introduction to digital gazetteer research. Int. J. Geogr. Inf. Sci. 2008, 22, 1039–1044. [Google Scholar] [CrossRef] [Green Version]
  31. Silva, M.J.; Martins, B.; Chaves, M.; Afonso, A.P.; Cardoso, N. Adding geographic scopes to web resources. Comput. Environ. Urban Syst. 2006, 30, 378–399. [Google Scholar] [CrossRef] [Green Version]
  32. Purves, R.S.; Clough, P.; Jones, C.B.; Arampatzis, A.; Bucher, B.; Finch, D.; Fu, G.; Joho, H.; Syed, A.K.; Vaid, S.; et al. The design and implementation of SPIRIT: A spatially aware search engine for information retrieval on the Internet. Int. J. Geogr. Inf. Sci. 2007, 21, 717–745. [Google Scholar] [CrossRef]
  33. Jones, C.B.; Purves, R.S. Geographical information retrieval. Int. J. Geogr. Inf. Sci. 2008, 22, 219–228. [Google Scholar] [CrossRef] [Green Version]
  34. Cohn, A.G.; Gotts, N.M. The ‘egg-yolk’representation of regions with indeterminate boundaries. In Geographic Objects with Indeterminate Boundaries; Taylor & Francis: London, UK, 1996; Volume 2, pp. 171–187. [Google Scholar]
  35. Fisher, P. Boolean and fuzzy regions. In Geographic Objects with Indeterminate Boundaries; Taylor & Francis: London, UK, 1996; Volume 2. [Google Scholar]
  36. Bittner, T.; Stell, J.G. Rough sets in approximate spatial reasoning. In Proceedings of the International Conference on Rough Sets and Current Trends in Computing, Banff, AB, Canada, 16–19 October 2000; Springer: Berlin, Germany, 2000; pp. 445–453. [Google Scholar]
  37. Clementini, E.; Di Felice, P. An algebraic model for spatial objects with indeterminate boundaries. In Geographic Objects with Indeterminate Boundaries; Taylor & Francis: London, UK, 1996; Volume 2, pp. 155–169. [Google Scholar]
  38. Kulik, L. A geometric theory of vague boundaries based on supervaluation. In Proceedings of the International Conference on Spatial Information Theory, Morro Bay, CA, USA, 19–23 September 2001; Springer: Berlin, Germany, 2001; pp. 44–59. [Google Scholar]
  39. Wieczorek, J.; Guo, Q.; Hijmans, R. The point-radius method for georeferencing locality descriptions and calculating associated uncertainty. Int. J. Geogr. Inf. Sci. 2004, 18, 745–767. [Google Scholar] [CrossRef] [Green Version]
  40. Guo, Q.; Liu, Y.; Wieczorek, J. Georeferencing locality descriptions and computing associated uncertainty using a probabilistic approach. Int. J. Geogr. Inf. Sci. 2008, 22, 1067–1090. [Google Scholar] [CrossRef]
  41. Liu, Y.; Guo, Q.H.; Wieczorek, J.; Goodchild, M.F. Positioning localities based on spatial assertions. Int. J. Geogr. Inf. Sci. 2009, 23, 1471–1501. [Google Scholar] [CrossRef]
  42. Montello, D.R.; Goodchild, M.F.; Gottsegen, J.; Fohl, P. Where’s down town?: Behavioral methods for determining referents of vague spatial queries. Spat. Cognit. Comput. 2003, 3, 185–204. [Google Scholar]
  43. Jones, C.B.; Purves, R.S.; Clough, P.D.; Joho, H. Modelling vague places with knowledge from the Web. Int. J. Geogr. Inf. Sci. 2008, 22, 1045–1065. [Google Scholar] [CrossRef] [Green Version]
  44. Grothe, C.; Schaab, J. Automated footprint generation from geotags with kernel density estimation and support vector machines. Spat. Cognit. Comput. 2009, 9, 195–211. [Google Scholar] [CrossRef]
  45. Hollenstein, L.; Purves, R. Exploring place through user-generated content: Using Flickr tags to describe city cores. J. Spat. Inf. Sci. 2010, 2010, 21–48. [Google Scholar]
  46. Gao, S.; Janowicz, K.; Montello, D.R.; Hu, Y.; Yang, J.A.; McKenzie, G.; Ju, Y.; Gong, L.; Adams, B.; Yan, B. A data-synthesis-driven method for detecting and extracting vague cognitive regions. Int. J. Geogr. Inf. Sci. 2017, 31, 1245–1271. [Google Scholar] [CrossRef]
  47. Speriosu, M.; Brown, T.; Moon, T.; Baldridge, J.; Erk, K. Connecting language and geography with region-topic models. In Proceedings of the Workshop on Computational Models of Spatial Language Interpretation (COSLI), Portland, OR, USA, 15 August 2010. [Google Scholar]
  48. Adams, B.; Janowicz, K. Thematic signatures for cleansing and enriching place-related linked data. Int. J. Geogr. Inf. Sci. 2015, 29, 556–579. [Google Scholar] [CrossRef]
  49. Twigger-Ross, C.L.; Uzzell, D.L. Place and identity processes. J. Environ. Psychol. 1996, 16, 205–220. [Google Scholar] [CrossRef]
  50. Vasardani, M.; Stirling, L.F.; Winter, S. The preposition at from a spatial language, cognition, and information systems perspective. Sema. Pragmat. 2017, 10. [Google Scholar] [CrossRef] [Green Version]
  51. Jordan, T.; Raubal, M.; Gartrell, B.; Egenhofer, M. An affordance-based model of place in GIS. In Proceedings of the 8th International Symposium on Spatial Data Handling, Vancouver, BC, Canada, 11–15 July 1998; Volume 98, pp. 98–109. [Google Scholar]
  52. Kuhn, W. Ontologies in support of activities in geographical space. Int. J. Geogr. Inf. Sci. 2001, 15, 613–631. [Google Scholar] [CrossRef] [Green Version]
  53. Scheider, S.; Kuhn, W. Affordance-based categorization of road network data using a grounded theory of channel networks. Int. J. Geogr. Inf. Sci. 2010, 24, 1249–1267. [Google Scholar] [CrossRef]
  54. Janowicz, K.; Keßler, C. The role of ontology in improving gazetteer interaction. Int. J. Geogr. Inf. Sci. 2008, 22, 1129–1157. [Google Scholar] [CrossRef] [Green Version]
  55. Jarvella, R.J.; Klein, W. Speech, Place, and Action: Studies of Deixis and Related Topics; John Wiley & Sons: Hoboken, NJ, USA, 1982. [Google Scholar]
  56. Couclelis, H.; Golledge, R.G.; Gale, N.; Tobler, W. Exploring the anchor-point hypothesis of spatial cognition. J. Environ. Psychol. 1987, 7, 99–122. [Google Scholar] [CrossRef]
  57. Richter, D.; Winter, S.; Richter, K.F.; Stirling, L. Granularity of locations referred to by place descriptions. Comput. Environ. Urban Syst. 2013, 41, 88–99. [Google Scholar] [CrossRef]
  58. Pustejovsky, J. Iso-space: Annotating static and dynamic spatial information. In Handbook of Linguistic Annotation; Springer: Berlin, Germany, 2017; pp. 989–1024. [Google Scholar]
  59. Coyne, B.; Sproat, R.; Hirschberg, J. Spatial relations in text-to-scene conversion. In Proceedings of the Computational Models of Spatial Language Interpretation, Workshop at Spatial Cognition, Portland, OR, USA, 15–19 August 2010. [Google Scholar]
  60. Kordjamshidi, P.; Frasconi, P.; Van Otterlo, M.; Moens, M.F.; De Raedt, L. Relational learning for spatial relation extraction from natural language. In Proceedings of the International Conference on Inductive Logic Programming, Windsor, UK, 31 July–3 August 2011; Springer: Berlin, Germany, 2011; pp. 204–220. [Google Scholar]
  61. Langacker, R.W. Foundations of Cognitive Grammar: Theoretical Prerequisites; Stanford University Press: Redwood City, CA, USA, 1987; Volume 1. [Google Scholar]
  62. Lakoff, G. Women, fire, and Dangerous Things; University of Chicago Press: Chicago, IL, USA, 2008. [Google Scholar]
  63. Talmy, L. How language structures space. In Spatial Orientation; Springer: Berlin, Germany, 1983; pp. 225–282. [Google Scholar]
  64. Levinson, S.C. Frames of reference and Molyneux’s question: Crosslinguistic evidence. Lang. Space 1996, 109, 169. [Google Scholar]
  65. Vandeloise, C. Spatial pRepositions: A Case Study from French; University of Chicago Press: Chicago, IL, USA, 1991. [Google Scholar]
  66. Chen, H.; Vasardani, M.; Winter, S. Georeferencing places from collective human descriptions using place graphs. J. Spat. Inf. Sci. 2018. Available online: http://www.josis.org/index.php/josis/article/viewFile/417/214 (accessed on 15 June 2018).
  67. Kim, J.; Vasardani, M.; Winter, S. From descriptions to depictions: A dynamic sketch map drawing strategy. Spat. Cognit. Comput. 2016, 16, 29–53. [Google Scholar] [CrossRef]
  68. Wolter, D.; Yousaf, M. Context and Vagueness in Automated Interpretation of Place Description: A Computational Model. In Proceedings of the International Conference on Spatial Information Theory, L’Aquila, Italy, 4–8 September 2017; Springer: Berlin, Germany, 2017; pp. 137–142. [Google Scholar]
  69. Scheider, S.; Janowicz, K. Place reference systems. Appl. Ontol. 2014, 9, 97–127. [Google Scholar]
  70. Alazzawi, A.N.; Abdelmoty, A.I.; Jones, C.B. What can I do there? Towards the automatic discovery of place-related services and activities. Int. J. Geogr. Inf. Sci. 2012, 26, 345–364. [Google Scholar] [CrossRef] [Green Version]
  71. Janowicz, K. Sim-DL: Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR\mathcal ALCNR in Geographic Information Retrieval. In Proceedings of the on the Move to Meaningful Internet Systems 2006: OTM 2006 Workshops, Montpellier, France, 29 October–3 November 2006; Springer: Berlin, Germany, 2006; pp. 1681–1692. [Google Scholar]
  72. Hobel, H.; Fogliaroni, P. Extracting semantics of places from user generated content. In Proceedings of the 19th AGILE International Conference on Geographic Information Science, Atlanta, GA, USA, 25–29 July 2016. [Google Scholar]
  73. Tenbrink, T. Reference frames of space and time in language. J. Pragmat. 2011, 43, 704–722. [Google Scholar] [CrossRef]
  74. Wallgrün, J.O.; Frommberger, L.; Wolter, D.; Dylla, F.; Freksa, C. Qualitative spatial representation and reasoning in the SparQ-toolbox. In Proceedings of the International Conference on Spatial Cognition, Bremen, Germany, 24–28 September 2006; Springer: Berlin, Germany, 2006; Volume 4387, pp. 39–58. [Google Scholar]
  75. Cohn, A.G.; Renz, J. Qualitative spatial representation and reasoning. Found. Artif. Intell. 2008, 3, 551–596. [Google Scholar]
  76. Lynch, K. The Image of the City; MIT Press: Cambridge, MA, USA, 1960; Volume 11. [Google Scholar]
  77. Tomko, M.; Winter, S. Describing the Functional Spatial Structure of Urban Environments. Comput. Environ. Urban Syst. 2013, 41, 177–187. [Google Scholar] [CrossRef]
  78. Moncla, L.; Gaio, M.; Nogueras-Iso, J.; Mustière, S. Reconstruction of itineraries from annotated text with an informed spanning tree algorithm. Int. J. Geogr. Inf. Sci. 2016, 30, 1137–1160. [Google Scholar] [CrossRef]
  79. Yao, X.; Thill, J.C. How Far Is Too Far?—A Statistical Approach to Context-contingent Proximity Modeling. Trans. GIS 2005, 9, 157–178. [Google Scholar] [CrossRef]
  80. Richter, D.; Winter, S.; Richter, K.F.; Stirling, L. How people describe their place: Identifying predominant types of place descriptions. In Proceedings of the 1st ACM SIGSPATIAL International Workshop on Crowdsourced and Volunteered Geographic Information, Redondo Beach, CA, USA, 6 November 2012; pp. 30–37. [Google Scholar]
  81. Kuhn, W. Core concepts of spatial information for transdisciplinary research. Int. J. Geogr. Inf. Sci. 2012, 26, 2267–2276. [Google Scholar] [CrossRef]
Figure 1. Place graph consisting of two example triplets: <courtyard, on, campus> and <courtyard, beside, clocktower>.
Figure 1. Place graph consisting of two example triplets: <courtyard, on, campus> and <courtyard, beside, clocktower>.
Ijgi 07 00221 g001
Figure 2. Parts of a merged place graph constructed from descriptions of the University of Melbourne campus, with node size corresponding to node degree, and edge size corresponding to number of relationships between the linked nodes. Multiple relationships between two nodes are represented by only one edge.
Figure 2. Parts of a merged place graph constructed from descriptions of the University of Melbourne campus, with node size corresponding to node degree, and edge size corresponding to number of relationships between the linked nodes. Multiple relationships between two nodes are represented by only one edge.
Ijgi 07 00221 g002
Figure 3. UML of the original place graph model.
Figure 3. UML of the original place graph model.
Ijgi 07 00221 g003
Figure 4. UML diagram illustrating the extended place graph database model, with seven types of classes (nodes) and nine types of relationships (edges).
Figure 4. UML diagram illustrating the extended place graph database model, with seven types of classes (nodes) and nine types of relationships (edges).
Ijgi 07 00221 g004
Figure 5. An example of modelling a relative direction relationship using the extended place graph.
Figure 5. An example of modelling a relative direction relationship using the extended place graph.
Ijgi 07 00221 g005
Figure 6. The implemented extended place graph database management system interface, with an example visualization of part of the test extended place graph. A clearer example of an extended place graph as well as explanations is provided in the Appendix A.
Figure 6. The implemented extended place graph database management system interface, with an example visualization of part of the test extended place graph. A clearer example of an extended place graph as well as explanations is provided in the Appendix A.
Ijgi 07 00221 g006
Figure 7. An example of deriving the ALR for place b through intersection of search spaces.
Figure 7. An example of deriving the ALR for place b through intersection of search spaces.
Ijgi 07 00221 g007
Figure 8. The spatial context of a merged, original place graph (left), and separated spatial contexts of an extended place graph (right).
Figure 8. The spatial context of a merged, original place graph (left), and separated spatial contexts of an extended place graph (right).
Ijgi 07 00221 g008
Figure 9. Search space of place B for relationship <B, right of, A> without a reference direction (left) compared to with anchored reference direction information (right).
Figure 9. Search space of place B for relationship <B, right of, A> without a reference direction (left) compared to with anchored reference direction information (right).
Ijgi 07 00221 g009
Figure 10. The search space of a qualitative distance relationship with contrast set information, represented by the shaded region.
Figure 10. The search space of a qualitative distance relationship with contrast set information, represented by the shaded region.
Ijgi 07 00221 g010
Figure 11. Determining consistency of directional relationships between a locatum B and a relatum A by search spaces. (a) an existing relationship and its search space; (b) another relationship that is consistent with the existing one, determined by search space; (c) another relationship that is inconsistent with the previous two.
Figure 11. Determining consistency of directional relationships between a locatum B and a relatum A by search spaces. (a) an existing relationship and its search space; (b) another relationship that is consistent with the existing one, determined by search space; (c) another relationship that is inconsistent with the previous two.
Ijgi 07 00221 g011
Figure 12. Consistency reasoning through reference direction translation. (a) an existing relationship; (b) another new relationship that is consistent with the existing one, determined by reference direction translation; (c) another relationship that is inconsistent with the previous two.
Figure 12. Consistency reasoning through reference direction translation. (a) an existing relationship; (b) another new relationship that is consistent with the existing one, determined by reference direction translation; (c) another relationship that is inconsistent with the previous two.
Ijgi 07 00221 g012
Figure 13. Percentages of places from different ALR refinement situations compared to baseline.
Figure 13. Percentages of places from different ALR refinement situations compared to baseline.
Ijgi 07 00221 g013
Figure 14. Refined approximate location region (ALR) sizes as percentages of the original (baseline) ALR size using the SC, RF, CS, and Hybrid methods.
Figure 14. Refined approximate location region (ALR) sizes as percentages of the original (baseline) ALR size using the SC, RF, CS, and Hybrid methods.
Ijgi 07 00221 g014
Figure 15. Distance errors between ground-truth and matched gazetteer locations for the baseline and hybrid methods.
Figure 15. Distance errors between ground-truth and matched gazetteer locations for the baseline and hybrid methods.
Ijgi 07 00221 g015
Figure 16. The locations of the three places mentioned in the descriptions above, with a red arrow indicating the walking direction.
Figure 16. The locations of the three places mentioned in the descriptions above, with a red arrow indicating the walking direction.
Ijgi 07 00221 g016
Table 1. Formal binary spatial relations considered in this research.
Table 1. Formal binary spatial relations considered in this research.
Spatial Relation FamilySpatial Relation Type
Cardinal directionnorth, south, east, west, northeast, southeast, northwest, southwest
Qualitative distancenear
Relative directionfront, back, left, right, left front, right front, left back, right back
Topologicalinside, covered by, overlap, meet, disjoint, cover, contain, equal
Table 2. Results for the three queries from the third experiment, with the second query using the place Alice Hoy as an example.
Table 2. Results for the three queries from the third experiment, with the second query using the place Alice Hoy as an example.
RankMost Frequent RelataPlaces Most Frequently Co-Occurring with Alice HoyMost Frequent Length-3 Paths
1University Of MelbourneMonash Road<Old Arts, right, Baillieu Library, left, South Lawn>
2Union BuildingEntrance<Baillieu Library, left, South Lawn, front, John Medley>
3Grattan StreetUniversity of Melbourne<Royal Parade, left, Baillieu Library, left, South Lawn>
4South LawnWilson Hall<Medical Building, left, Baillieu Library, left, South Lawn>
5Swanston StreetPeter Hall Building<Baillieu Library, left, South Lawn, left, Wilson Hall>

Share and Cite

MDPI and ACS Style

Chen, H.; Vasardani, M.; Winter, S.; Tomko, M. A Graph Database Model for Knowledge Extracted from Place Descriptions. ISPRS Int. J. Geo-Inf. 2018, 7, 221. https://doi.org/10.3390/ijgi7060221

AMA Style

Chen H, Vasardani M, Winter S, Tomko M. A Graph Database Model for Knowledge Extracted from Place Descriptions. ISPRS International Journal of Geo-Information. 2018; 7(6):221. https://doi.org/10.3390/ijgi7060221

Chicago/Turabian Style

Chen, Hao, Maria Vasardani, Stephan Winter, and Martin Tomko. 2018. "A Graph Database Model for Knowledge Extracted from Place Descriptions" ISPRS International Journal of Geo-Information 7, no. 6: 221. https://doi.org/10.3390/ijgi7060221

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop