Exploring Prior Knowledge from Human Mobility Patterns for POI Recommendation

Song, Jingbo; Yi, Qiuhua; Gao, Haoran; Wang, Buyu; Kong, Xiangjie

doi:10.3390/app13116495

Open AccessArticle

Exploring Prior Knowledge from Human Mobility Patterns for POI Recommendation

by

Jingbo Song

¹,

Qiuhua Yi

²

,

Haoran Gao

³,

Buyu Wang

⁴

and

Xiangjie Kong

^2,*

¹

School of Arts, Tourism College of Zhejiang, Hangzhou 311231, China

²

College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China

³

School of Software, Dalian University of Technology, Dalian 116620, China

⁴

College of Computer and Information Engineering, Inner Mongolia Agricultural University, Hohhot 010018, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2023, 13(11), 6495; https://doi.org/10.3390/app13116495

Submission received: 25 April 2023 / Revised: 21 May 2023 / Accepted: 23 May 2023 / Published: 26 May 2023

(This article belongs to the Special Issue Advances in Artificial Intelligence (AI)-Driven Data Mining)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Point of interest (POI) recommendation is an important task in location-based social networks. It plays a critical role in smart tourism and makes it more likely for tourists to have personalized travel experiences. However, most current recommendation methods are based on learning the users’ check-in history and the users’ relationship network in the social network to make recommendations.Therefore, urban crowds’ regular travel patterns cannot be effectively utilized. In this paper, we propose a POI recommendation algorithm (HMRec) based on prior knowledge of human mobility patterns to solve this problem. Specifically, we propose the Human Mobility Pattern Extraction (HMPE) framework, which utilizes graph neural networks as extractors for human mobility patterns. The framework incorporates attention mechanisms to capture spatio-temporal information in urban traffic patterns. HMPE employs downstream tasks and design upsampling modules to reconstruct representation vectors for task objectives, enabling end-to-end training of the framework and obtaining pre-trained parameters for the human mobility pattern extractor. Furthermore, we introduce the Human Mobility Recommendation (HMRec) algorithm, which improves feature cross-interactions in the breadth model and incorporates prior knowledge of human patterns. This ensures that the recommendation results align more closely with human travel patterns in urban environments. Comparative experiments conducted on the Foursquare dataset demonstrate that HMRec outperforms baseline models with an average performance improvement of approximately 3%. Finally, we discuss existing challenges and future research directions, including approaches to address the issue of data sparsity.

Keywords:

POI recommendation; human mobility pattern; graph neural network

1. Introduction

Undoubtedly, smart tourism is an important component of smart cities. As tourist destinations continue to develop and change, people’s travel choices are becoming more diverse and complex. Smart tourism aims to intelligently connect and manage cities and tourism resources using technologies such as the Internet of Things (IoT), big data, and artificial intelligence in order to provide personalized tourism services to travelers [1]. In the context of deep learning technology, Point of Interest (POI) recommendation plays a crucial role in smart tourism. It can extract preference information from users’ historical travel data and provide them with greater convenience and satisfaction in their travel experiences. POI recommendation can also alleviate the current situation of over-tourism [2], reducing the burden and environmental pressure of tourism destinations.

On the other hand, the Internet of Things (IoT), as a network infrastructure, collects various needed information in real-time through various technologies, realizes the ubiquitous connection between things and other things and between things and people, and achieves intelligent perception of objects and processes [3,4]. Through sensors and devices in the Internet of Things (IoT), environmental information, foot traffic, traffic conditions, and other data surrounding Points of Interest (POIs) can be collected and monitored. These dynamic and heterogeneous data can be used for real-time monitoring, analysis, and prediction of POIs, thereby providing more accurate recommendations and services.

Due to the widespread use of mobile positioning devices (smartphones) and the rapid development of Location-Based Social Networks (LBSNs) [5], users can share location data by checking in at POIs on social networks [6]. Through LBSNs, users can record and share their activities and experiences at different POIs, generating a large number of POI-related data [7]. These POI-related data include users’ interests and behavioral information at specific locations, which can reflect the preferences of user groups. These data can be used to discover human mobility patterns and provide important reference for POI recommendation systems. POI recommendation is an important way to assist users in exploring the surrounding environment to improve user experience. The POI recommendation algorithm provides users with personalized recommendation services by learning user historical data. At present, the mainstream POI recommendation system focuses on learning users’ preferences from historical social network data to make recommendations [8]. However, the results obtained in this way may be counter-intuitive, such as recommending a downtown cafe on a non-weeknight. Therefore, POI recommendation should combine prior knowledge of human mobility patterns [9].

In order to efficiently explore prior knowledge from human mobility patterns, various data in cities (traffic trajectory data, map data, location data, social media data, consumption data, etc.) should be effectively utilized. At present, understanding how to effectively represent the relationship between multi-source heterogeneous data in cities and how to extract and analyze the features of multi-source heterogeneous data relationships are still important and urgent problems to be solved in the research on human mobility patterns. In addition, studies purely aimed at extracting human mobility patterns are rare. Human mobility patterns are usually an intermediate result of completing a specific downstream task. Understanding how to select downstream tasks and verify the quality of the obtained human mobility patterns is also a problem investigated in this paper.

To address these problems, we propose a POI recommendation method based on prior knowledge of human mobility patterns. This method makes full use of the prior knowledge of human mobility patterns to solve the difficult problem of choice caused by a large amount of promotional information flooded in social networks. The contributions of this paper are summarized as follows:

(1): We propose the Human Mobility Pattern Extraction (HMPE) architecture to learn human mobility patterns and extract their representation.
(2): We design a traffic congestion prediction task and conduct experiments on a real taxi dataset from New York, which verifies the feasibility of using an end-to-end architecture to learn human mobility patterns.
(3): We propose a Point of Interest recommendation algorithm called Human Mobility Recommendation (HMRec), which incorporates prior knowledge of human mobility patterns.
(4): We conduct a comparative experiment on the Foursquare dataset, and the experimental results validate the effectiveness of prior knowledge of human mobility patterns.

The rest of this paper is organized as follows. Section 2 introduces the research background and significance of the POI recommendation system and then describes the research status of human mobility pattern discovery and POI recommendation at home and abroad. Section 3 presents the introduction of related theories, focusing on the discovery of human mobility patterns, the POI recommendation algorithm, and related theories and technologies involved. Section 4 provides the experimental design and analysis of results. Section 5 summarizes the specific contributions and implications of this paper and discusses possible future research directions.

2. Related Work

In this section, we introduce the latest progress in POI recommendation and human mobility patterns.

2.1. POI Recommendation

The development of POI recommendation can be attributed to the continuous innovation of data fusion methods, from geographical location information to users’ social relations to the spatiotemporal patterns of cities; more and more key elements have been discovered by researchers as new features of recommendation algorithms [10].

The early POI recommendation algorithm uses the user’s check-in frequency to make recommendations [11]. Berjani et al. [12] used regularized matrix factorization to provide personalized recommendations to users in social networks. However, the raw data values recommended by POI have a very wide range. In addition, the sparsity of the user check-in matrix is much larger than that of the item-rating matrix. For example, the sparsity of the Netflix dataset is 99%, while the sparsity of Gowalla is

2.08 \times 10^{- 4}

. These situations facilitate POI recommendations for data fusion [13]. In the data fusion method, the earliest introduction of POI recommendation is the geographic location information. For POI recommendation systems, the first law of geography is shown as follows: users are more interested in POIs that are geographically closer to them, and they will have higher interest in POIs near the POIs they are interested in. Si et al. [14] proposed an adaptive POI recommendation method combining users’ activities and spatial features, which can operate adaptively according to users’ activities. Liu et al. [15] proposed the Geo-ALM model, which utilizes adversarial learning with geographic information to fuse geographic features and generative adversarial networks. In LBSNs, some POI recommendation works utilized social relationship networks to improve recommendation quality [7,16]. Zhang et al. [17] formed a three-layer model of LBSN with multi-label, social, and geographic influences and finally integrated multi-dimensional information into a matrix factorization framework. Zhang et al. [18] proposed a contextual graph attention model, and Wang et al. [19] used a graph-enhanced spatial-temporal network for POI recommendation.

However, research on POI recommendations is still in its preliminary stage. Although the existing work takes into account the influence of space, social interaction, time, and other factors on the recommendation effect of points of interest, the recommendation target tends to fit the historical check-in behaviors of individual users who are unable to learn the travel rules of people in the city and finally obtains a recommendation result that does not conform to common sense.

2.2. Human Mobility Pattern

In order to efficiently mine human mobility patterns and introduce them into the point of interest recommendation task, all kinds of data in the city should be effectively utilized [20,21]. Therefore, we need to solve two problems: urban multivariate data fusion and human mobility pattern mining.

For urban multivariate data fusion, H. Zhang et al. [22] used taxi data, shared bike data, and road network data to detect abnormal areas in stages. They used different data at different stages of the task process to achieve the purpose of data fusion. X. Liu et al. [23] proposed a deep convolutional auto-encoder architecture that enables the encoding layer to learn the features of different data. Some work [24,25,26] used the idea of collaborative filtering to establish the relationship between different data sources for data fusion. The fusion of multivariate data requires the algorithm to have strong scalability, but there is no general method to meet the needs.

The human mobility pattern itself is also a spatio-temporal pattern, and spatio-temporal pattern mining directly affects the quality of POI recommendations. In order to adapt to different spatio-temporal networks, Li et al. [27] proposed an efficient spatio-temporal neural structure search method to try to automatically construct a general neural network suitable for different spatio-temporal prediction tasks in cities. Due to the lack of semantics in the spatio-temporal data features generated using deep neural networks, Wang et al. [28] proposed a new deep learning framework based on particle swarm optimization. It can be seen that the research on human mobility patterns is often related to people’s travel behaviors and trajectories, and the pros and cons of human mobility pattern mining results often require downstream tasks for verification and support.

Compared to the above works, we propose the architecture of human mobility pattern extraction and then design the end-to-end architecture to learn the characteristics of human mobility patterns, and finally apply the prior knowledge of human mobility patterns to POI recommendations.

3. Methodology

3.1. Human Mobility Pattern Discovery

3.1.1. Spatio-Temporal Graph

Inspired by the widely used data structure of graphs, the spatio-temporal graph data structure proposed in this paper can more accurately represent the spatio-temporal properties of cities [29]. As long as the data can be structured using graphs, spatio-temporal graphs can generally be adapted [30]. We formulate the spatio-temporal graph as

G_{s e q} = {G_{0}, G_{1}, G_{2}, \dots}

, and it can be seen that it consists of a sequence of spatial graphs aggregated by multiple time slices. The spatial graph is defined as

G_{t} (V_{t}, E_{t}, A_{t})

, where

V_{t}

represents the set of all nodes in the time slice,

E_{t}

represents the set of edges in the time slice, and

A_{t}

represents the attribute set of the nodes in the time slice. We adopt spatiotemporal graphs to structurally represent the human mobility pattern of cities and the functional properties of urban areas.

3.1.2. Human Mobility Pattern Discovery Architecture

The human mobility patterns discovery framework uses the prediction of downstream tasks to pre-train the human-mobility-pattern-extraction module, so that the human-mobility-pattern-extraction module obtains the ability to represent the spatio-temporal pattern under the specific city data distribution. At the same time, the prediction results of the downstream tasks themselves can help to test the representation effect of human mobility patterns. The overall architecture is shown in Figure 1. The human mobility pattern-extraction framework has three main parts, the human-mobility-pattern-extraction module, the encoding and decoding module based on the multi-head attention mechanism, and the upsampling module adapted to the downstream task. The three modules form a unified end-to-end model, and the resulting intermediate hidden layer vectors contain the spatio-temporal contextual relations of the human mobility pattern.

The human-mobility-pattern-extraction module is composed of a graph convolutional neural network with shared weights and a feedforward neural network [19]. Given a graph

G (V, E)

, where V and E are the sets of points and edges of the graph, respectively, with the point set size

n = | V |

and the edge set size

m = | E |

. The adjacency matrix of graph G is

A \in R^{n \times n}

. Assuming that there are connected edges between nodes i and j, then

A_{i j} = 1

; otherwise

A_{i j} = 0

. The degree matrix of graph G can be obtained as

D = d i a g (d_{1}, d_{2}, \dots, d_{n})

, where

d_{i}

is the degree of node i in graph G. The Laplacian matrix of the graph is

L = D - A

. We perform an eigenvalue decomposition of

L

to obtain

L = U Λ U^{T}

. Defining the graph signal

x = {[x_{1}, x_{2}, \dots, x_{n}]}^{T}

, for any graph signal x, we can obtain the following formula:

\begin{matrix} x^{T} Lx & = x^{T} Dx - x^{T} Ax \\ = \sum_{i} d_{i} x_{i}^{2} - \sum_{(i, j) \in E} A_{i j} x_{i} x_{j} \\ = \sum_{(i, j) \in E} {(x_{i} - x_{j})}^{2} \end{matrix}

(1)

So far, we can derive the corresponding inverse transform

x = U \hat{x}

of the Fourier transform

\hat{x} = U^{T} x

of the graph signal. The convolution form of the graph signal x and the convolution kernel h on graph G can be obtained, as shown in Formula (2).

x * h = U \cdot d i a g (\hat{h}) \cdot U^{T} x

(2)

We replace

d i a g (\hat{h})

above with

\sum_{k = 0}^{K} α_{k} Λ^{k}

, where

{α_{k}}_{k = 0}^{K}

is a learnable parameter. So far, we have obtained a graph convolutional neural network model that does not require feature decomposition and has a computational complexity of

O (n)

:

\begin{matrix} y & = σ (U \cdot (\sum_{k = 0}^{K} α_{k} Λ^{k}) \cdot U^{T} x) \\ = σ (\sum_{k = 0}^{K} α_{k} L^{k} x) \end{matrix}

(3)

In this module, the graph signal refers to the functional attribute distribution vector of the urban area, and the adjacency matrix is the spatio-temporal graph of the flow between urban areas. We set the matrix after rasterization of the city to have

r \times r

areas, the attribute vector of each area to have

d_{f i}

dimensions, and the area representation dimension after graph convolution to be

d_{f o}

. The graph neural network framework used in this module can be simplified as follows:

Y = σ ({\hat{D}}^{- 1} \hat{A} XW)

(4)

where

X \in R^{r^{2} \times d_{f i}}

is the city function attribute matrix,

A \in R^{r^{2} \times r^{2}}

is the traffic adjacency matrix,

\hat{A} = A + I

,

\hat{D}

is the degree matrix corresponding to

\hat{A}

, and

W \in R^{d_{f i} \times d_{f o}}

is the network parameter to be learned.

XW

linearly changes the feature vector of the node,

\hat{A} XW

propagates the transformed node feature to neighbors, and

{\hat{D}}^{- 1} \hat{A} XW

normalizes the feature received by the node.

In this paper, the graph convolutional neural network [32,33] to which each time slice belongs shares the parameter weight, and only one graph neural network is trained and maintained. For the input backpropagation gradients of different time slices, we use the sum-average processing. The human mobility pattern extraction strategy learned by the graph neural network is applicable to all time slices [34].

Since the input data of the coding module are a sequence composed of the representation vectors of urban human mobility patterns under a single time slice instead of the whole graph representation, we use the feedforward neural network [35] module to realize the function of the ReadOut module in DGI [36] and obtain the hidden vector representation of the whole graph.

In the encoder–decoder module based on the multi-head attention mechanism, we refer to the encoder–decoder structure of Transformer [37]. We use a sequence of representation vectors of human mobility patterns as input representations and target data representations of downstream tasks as output representations. The attention mechanism can be represented by the following formula:

A t t e n t i o n (Q, K, V) = s o f t m a x (\frac{Q K^{T}}{\sqrt{d_{k}}}) V

(5)

We set the original attention module to be replicated h times, i.e., h head attention. We input the same data to the multi-head attention module to obtain h single-head attention results. We then spliced the multi-head results and transformed them through a linear layer to obtain the fused feature output [38]. The multi-head attention formula is as follows:

M u l t i H e a d (Q, K, V) = C o n c a t (h e a d_{1}, h e a d_{2}, \dots, h e a d_{h}) W^{o}

(6)

We used the human-mobility-pattern-extraction module to replace the original list of embedding in the transformer. The representation is directly extracted from the original data, and the output module is replaced by the downstream-task feature-extraction module. The upsampling module is used to restore the predicted representation vector to the same structure as the real data. In this paper, we use the deconvolution network as the upsampling module and generate a map of the urban traffic congestion situation from the predicted representation vector

e \hat{m} b \in R^{d}

, denoted as

\hat{S} \in R^{r \times r}

. We calculate the average speed of

r \times r

nodes in the time period, and the average speed of each node in each time slice can be regarded as a map snapshot of urban traffic congestion. We denote the snapshot of the congested state of the city-market traffic corresponding to the time slice t as

S_{t} \in R^{r \times r}

. We thereby formulate the task of urban traffic congestion state assessment. Given the spatio-temporal graph sequence of traffic flow as

G_{s e q} = {G_{1}, G_{2}, \dots, G_{t}}

, the corresponding function distribution of each region as

A \in R^{r^{2} \times d_{f i}}

, the sequence of urban traffic congestion state as

A \in R^{r^{2} \times d_{f i}}

, and the prediction of urban traffic congestion state in time slice t as

S_{t}

, let the output result of the overall framework be

{\hat{S}}_{t}

, and expect

{\hat{S}}_{t}

to be as close as possible to

S_{t}

. Then, the optimization objective of the overall framework is

L = ∥ {\hat{S}}_{t} - S_{t} ∥

(7)

In order to adapt to the data characteristics, we used the convolutional neural network in the downstream task feature extraction module of this paper and the deconvolutional neural network in the upsampling module.

3.2. POI Recommendation Algorithm Based on Human Mobility Patterns

In a typical location-based social network (LBSNs), the POI recommendation system has a set of N users

U = {u_{1}, u_{2}, \dots, u_{N}}

and a set of M geographic locations

L = {l_{1}, l_{2}, \dots, l_{N}}

, also known as the set of interest points. The set of interest points accessed by user u is represented by

L_{u}

. Each location

l_{i}

is geocoded with <

l o n g i t u d e, l a t i t u d e

>. Generally, we convert the user’s check-in information into the user-POI check-in frequency matrix

C

. Each entry

c_{u i}

of

C

represents the check-in frequency of user u at place i, and the check-in frequency reflects the user’s preference for different points of interest. Typically, the user visits only a small number of locations, so the matrix

C

is very sparse. We retain the weight of the human mobility pattern extractor model and separate it out separately. The result is used as the part of urban human mobility feature extraction of the recommendation algorithm [39]. The HMRec algorithm borrows from the Wide&Deep model [40], so they have similar time complexity, approximately

O (d + L n)

, where d represents the dimensionality of the input features, L is the number of layers in the model, and n is the number of neurons per layer. The specific structure of the model is shown in Figure 2.

The HMRec model utilizes multiple cross layers for feature crossing [41]. If the output vector of the l-th Cross layer is

x_{l}

, then the output of the

l + 1

-th layer is

x_{l + 1} = x_{0} x_{l}^{T} W_{l} + b_{l} + x_{l}

(8)

The second-order operation of the cross layer is similar to the outer product operation in the PNN model, on top of which we add the weights

w_{l}

of the outer product operation, as well as the input

x_{l}

and bias

b_{l}

. It can be seen that the cumulative effect of the superposition of the Cross layers on the parameter quantity is relatively slow, and each layer only adds an n-dimensional weight vector

w_{l}

, where n is the dimension of the input vector. In addition, the original input vector

x_{l}

is preserved at each layer so that the variation between input and output is small. We retain the weight of the human mobility pattern extractor model and separate it out separately, and it is used as part of the urban human mobility feature extraction of the recommendation algorithm [42]. The overall algorithm flow is shown in Algorithm 1.

Algorithm 1 HMRec Algorithm

Input: Features of urban human mobility pattern

f_{h}

, point of interest

I D_{i}

, user

I D_{j}

, dense features

f_{d}^{1}, f_{d}^{2}, \dots, f_{d}^{m}

, discrete features

f_{s}^{1}, f_{s}^{2}, \dots, f_{s}^{n}

Output: The predicted score p of user j checking in at POI i
Part I: Cross Layer
1: transform the discrete features into the dense features:

f_{d}^{m + k} = E m b e d d i n g (f_{s}^{k})

2: splice dense features:

f_{d} = C a t (f_{d}^{1}, f_{d}^{2}, \dots, f_{d}^{m + n}, f_{h})

3: using deep networks to fuse dense features:

h_{0} = R e L U (W_{0} f_{d} + b_{0})

4: get dense features:

h_{l + 1} = R e L U (W_{l} h_{l} + b_{l})

Part II: Deep Layer
5: discrete feature multi-layer high-order intersection:

x_{l + 1} = x_{0} x_{l}^{T} W_{l} + b_{l} + x_{l}

6: connect the Deep network and cross network and output:

x_{c} = C a t (h, x)

7: return return predicted score

p = s i g m o i d (W x_{c} + b)

4. Results and Discussion

4.1. Urban Traffic Congestion Status Assessment Task

In this paper, in order to ensure that the spatio-temporal pattern extraction framework can learn urban spatio-temporal features and at the same time evaluate the learning effect of this feature, we design urban congestion situation prediction as a downstream task of the extraction framework. This is also a general practice for evaluating the performance of vector representations in the hidden middle layers of an encoding–decoding framework [43].

We used yellow taxi data and POI data from May and June 2012 in New York City to conduct a comparative experiment. The experiment selected the spatial range of longitude from −74.0108 to −73.9600 and latitude from 40.7333 to 40.7700 in New York City. Within the time and space selected for the experiment, the number of taxi trajectory data was about 9 million, the number of point-of-interest data was 1612, and there were 18 types of points of interest.

In this study, we designed ablation experiments to verify the effectiveness of the HMPE algorithm. In this paper, a method similar to unsupervised learning verification was adopted to evaluate the learning effect of the model on target hidden vectors using the performance results of the model in downstream tasks. The HMPE algorithm framework proposed in this paper mainly needs to verify two parts: the human mobility pattern extraction module and the timing feature extraction module. Therefore, the control part of the ablation experiment was set as the representative algorithm of each part.

For the human mobility pattern extraction module, there are two parts in the framework; one is the graph representation algorithm with attributes, and the other is ReadOut. Consider the Node2Vec algorithm [44] (Scalable Network Feature Learning) as a representative of a class of algorithms for attribute-free, purely structured representations. We used ANRL [45] (Deep-Neural-Network-Based Attribute Network Representation Learning) to compare the differences between attribute-based graph-convolution algorithms and attribute-based random–walk algorithms. In the ReadOut section, we used a convolutional neural network to verify the difference between city-wide global view (MLP) and local view (CNN [46]) in urban spatio-temporal scenes. For the time series feature-extraction module, the GRU [47] in the long-term and short-term memory network was used as the representative algorithm to compare the difference between learning only a single long-term and short-term memory vector and using the attention mechanism to treat the features of each time period differently in the urban spatio-temporal scene [48,49].

In the experiment, we set 200 epochs, the test set ratio was set to 0.3, the training batch size was set to four, the side length of the city raster division was set to 16, and the time series length was set to 24. In addition, in order to ensure the singleness of the ablation experiment variables, we limited the size of the output vector of the human mobility feature module to 32 and set the number of channels to eight. The computational focus of the experiment lies primarily in the encoding and decoding modules of the multi-head attention mechanism. The architecture of this module is inspired by the encoding and decoding structure of the transformer, so the time complexity is similar. The time complexity of the transformer model primarily depends on the computational complexity of its self-attention mechanism and feed-forward neural network. The time complexity per layer is approximately

O (n^{2} d)

, where n represents the sequence length and d represents the representation dimension. We use Mean Absolute Percentage Error (MAPE) and Rooted Mean Square Error (RMSE) as the evaluation indicators of the framing effect.

The experimental results are shown in Table 1, and it can be seen that the performance of the HMPE framework proposed in this paper is better than other baseline models in terms of RMSE and MAPE. This shows that the HMPE framework performs as expected, effectively extracting human mobility patterns in cities. Compared with the structure using Node2Vec, HMPE makes better use of the features of urban functional areas. Compared with using CNN as the structure of ReadOut, HMPE uses MLP to achieve a global receptive field instead of a local receptive field and achieves better results. Since the two structures of Node2Vec and ANRL are staged models, the results obtained by the end-to-end training of HMPE are obviously superior. Note that the performance of GCN-MLP-GRU is also quite good, which to some extent shows that a good representation of urban spatiotemporal features is very helpful for downstream tasks.

4.2. POI Recommendation Based on Human Mobility Pattern

We selected the travel data and check-in data of New York City in May and June 2012 for the experiment. The user check-in data came from the Foursquare dataset publicly available on Kaggle, which contains 227,428 user check-in data. We aggregated the information on the check-in time according to the hour, and obtained 1464 time slices of the check-in data. Arranging the check-in data in chronological order, we took the first 70% of the data as the training set and removed the points of interest that users have checked in to from the remaining 30% data to obtain the test set.

Comparative experimental baseline models include the deep factorization machine (DeepFM) [50], the representational multi-layer perceptron (EmbeddingMLP), the deep-learning-based collaborative filtering model (NeuralCF) [51], the Wide&Deep model, and the Twin-tower model [52]. In the comparison experiment on the POI recommendation model, this paper used the accuracy rate (ACC) and two AUC values (ROC and PR) as the experimental indicators.

The experimental results are shown in Table 2. It can be seen that the HMRec algorithm proposed in this paper achieved results that exceed all baseline models in terms of accuracy and receiver operating characteristic curve (ROC), which proves that the urban spatio-temporal attribute of human mobility patterns plays a certain role in POI recommendation. It is worth noting that the performance of HMRec on the precision-recall curve (PR) is slightly lower than that of the DeepFM algorithm because the DeepFM model does not incorporate prior information on human mobility patterns and thus has a higher degree of fitting to positive samples. Among all the baseline models, the performance of the Twin-tower model is relatively poor, which may be due to the fact that in the case of sparse samples, using a deep network for both users and points of interest can easily produce over-fitting results, resulting in the test set. The generalization ability is relatively poor. In contrast, NeuralCF achieves better results than the Twin-tower model due to its simplicity.

By visualizing the ROC and PR curves (as shown in Figure 3), it can be seen that HMRec, Wide&Deep, and DeepFM are relatively close in terms of ROC curves. The curve of EmbeddingMLP has a steeper change in the middle, indicating that it does not perform well when predicting scores around 0.5. From the PR curve, it can be seen that the performance of HMRec, Wide&Deep, and DeepFM is still relatively stable, and Wide&Deep is not as good as HMRec and DeepFM after the recall rate increases. The performance of NeuralCF is very unstable, and the curve has large fluctuations. We can see that the HMRec represented by the red line achieves the best results.

However, it must be acknowledged that our proposed method may have some drawbacks. The acquisition and analysis of crowd movement patterns may raise concerns regarding personal privacy and data protection. Although crowd movement patterns can provide useful information about user behavior, the sparsity of data may impact the effectiveness of the model as users typically visit only a small fraction of the overall set of locations. This can result in inaccurate predictions for certain locations or user behaviors.

5. Conclusions

At present, the existing research work on POI recommendation algorithms focuses on the mining of the interaction history between users and POIs and the research on users’ social relations, ignoring the urban scene where POI recommendation is located and the objective existence of the prior knowledge of human mobility patterns. To this end, this paper proposes a human mobility pattern extraction framework, HMPE, and designs an end-to-end pre-training process using graph neural networks and attention mechanisms. This paper proposes the HMRec POI recommendation algorithm to incorporate prior knowledge of human mobility patterns into POI recommendation. Experiments verify the effectiveness of human mobility patterns in POI recommendation.

In the future, we will attempt to mitigate the issue of data sparsity by employing data imputation techniques, and we will explore the design of multiple related downstream tasks to simultaneously obtain spatio-temporal prior knowledge from multiple tasks, aiming to comprehensively improve the effectiveness of POI recommendation.

Author Contributions

Conceptualization and methodology, J.S.; data curation and formal analysis, J.S. and Q.Y.; experiments and analysis, J.S., Q.Y., H.G. and B.W.; investigation, H.G.; validation and visualization, Q.Y., H.G. and B.W.; writing—original draft preparation, J.S., Q.Y. and H.G.; writing—review and editing, X.K. and Q.Y.; resources and supervision, X.K.; funding acquisition, X.K. and B.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partially supported by the National Natural Science Foundation of China (62072409), Zhejiang Provincial Natural Science Foundation (LR21F020003), Program, for improving the Scientific Reasearch Ability of Youth Teachers of Inner Mongolia Agricultural University (RZ2200001860) and Major Science and Technology Projects of Inner Mongolia Autonomous Region (2020ZD0004).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Zhang, Y.; Sotiriadis, M.; Shen, S. Investigating the Impact of Smart Tourism Technologies on Tourists’ Experiences. Sustainability 2022, 14, 3048. [Google Scholar] [CrossRef]
Kong, X.; Huang, Z.; Shen, G.; Lin, H.; Lv, M. Urban Overtourism Detection Based on Graph Temporal Convolutional Networks. IEEE Trans. Comput. Soc. Syst. 2022. early access. [Google Scholar] [CrossRef]
Wang, W.; Kumar, N.; Chen, J.; Gong, Z.; Kong, X.; Wei, W.; Gao, H. Realizing the potential of the internet of things for smart tourism with 5G and AI. IEEE Netw. 2020, 34, 295–301. [Google Scholar] [CrossRef]
Kong, X.; Wu, Y.; Wang, H.; Xia, F. Edge Computing for Internet of Everything: A Survey. IEEE Internet Things J. 2022, 9, 23472–23485. [Google Scholar] [CrossRef]
Wu, X.; Fu, L.; Wang, S.; Jiang, B.; Wang, X.; Chen, G. Collective Influence Maximization in Mobile Social Networks. IEEE Trans. Mob. Comput. 2021, 22, 797–812. [Google Scholar] [CrossRef]
Li, D.; Gong, Z. A deep neural network for crossing-city poi recommendations. IEEE Trans. Knowl. Data Eng. 2020, 34, 3536–3548. [Google Scholar] [CrossRef]
Fang, J.; Meng, X.; Qi, X. A top-k POI recommendation approach based on LBSN and multi-graph fusion. Neurocomputing 2023, 518, 219–230. [Google Scholar] [CrossRef]
Wang, X.; Qin, J.; Deng, S.; Zeng, W. Knowledge-Aware Enhanced Network Combining Neighborhood Information for Recommendations. Appl. Sci. 2023, 13, 4577. [Google Scholar] [CrossRef]
Gao, Q.; Wang, W.; Huang, L.; Yang, X.; Li, T.; Fujita, H. Dual-grained human mobility learning for location-aware trip recommendation with spatial–temporal graph knowledge fusion. Inf. Fusion 2023, 92, 46–63. [Google Scholar] [CrossRef]
Liu, Y.; Yang, Z.; Li, T.; Wu, D. A novel POI recommendation model based on joint spatiotemporal effects and four-way interaction. Appl. Intell. 2022, 52, 5310–5324. [Google Scholar] [CrossRef]
Zhang, J.; Shi, W.; Xiu, C. Application of POI data in Chinese urban research. Geogr. Sci. 2021, 41, 140–148. [Google Scholar]
Berjani, B.; Strufe, T. A recommendation system for spots in location-based online social networks. In Proceedings of the 4th Workshop on Social Network Systems, Salzburg, Austria, 10 April 2011; pp. 1–6. [Google Scholar]
Yu, Y.; Chen, X. A survey of point-of-interest recommendation in location-based social networks. In Proceedings of the Workshops at the Twenty-Ninth AAAI Conference on Artificial Intelligence, Austin, TX, USA, 25–30 January 2015. [Google Scholar]
Si, Y.; Zhang, F.; Liu, W. An adaptive point-of-interest recommendation method for location-based social networks based on user activity and spatial features. Knowl.-Based Syst. 2019, 163, 267–282. [Google Scholar] [CrossRef]
Liu, W.; Wang, Z.J.; Yao, B.; Yin, J. Geo-ALM: POI Recommendation by Fusing Geographical Information and Adversarial Learning Mechanism. In Proceedings of the IJCAI, Macao, China, 10–16 August 2019; Volume 7, pp. 1807–1813. [Google Scholar]
Lu, L.U.; Zhu, F.; Gao, R.; Zhu, L. Point of interest joint recommendation method based on user-content topic model. Comput. Eng. Appl. 2018, 54, 154–159. [Google Scholar]
Zhang, Z.; Liu, Y.; Zhang, Z.; Shen, B. Fused matrix factorization with multi-tag, social and geographical influences for POI recommendation. World Wide Web 2019, 22, 1135–1150. [Google Scholar] [CrossRef]
Zhang, S.; Cheng, H. Exploiting context graph attention for poi recommendation in location-based social networks. In Lecture Notes in Computer Science, Database Systems for Advanced Applications, Proceedings of the 23rd International Conference, DASFAA 2018, Gold Coast, QLD, Australia, 21–24 May 2018; Springer: Cham, Switzerland, 2018; pp. 83–99. [Google Scholar]
Wang, Z.; Zhu, Y.; Zhang, Q.; Liu, H.; Wang, C.; Liu, T. Graph-enhanced spatial-temporal network for next POI recommendation. ACM Trans. Knowl. Discov. Data 2022, 16, 1–21. [Google Scholar] [CrossRef]
Ning, Z.; Yang, Y.; Wang, X.; Guo, L.; Gao, X.; Guo, S.; Wang, G. Dynamic computation offloading and server deployment for UAV-enabled multi-access edge computing. IEEE Trans. Mob. Comput. 2021, 22, 2628–2644. [Google Scholar] [CrossRef]
Wang, X.; Ning, Z.; Guo, S.; Wen, M.; Poor, H.V. Minimizing the age-of-critical-information: An imitation learning-based scheduling approach under partial observations. IEEE Trans. Mob. Comput. 2021, 21, 3225–3238. [Google Scholar] [CrossRef]
Zhang, H.; Zheng, Y.; Yu, Y. Detecting urban anomalies using multiple spatio-temporal data sources. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 2018, 2, 1–18. [Google Scholar] [CrossRef]
Liu, X.; Wang, M.; Zha, Z.J.; Hong, R. Cross-modality feature learning via convolutional autoencoder. ACM Trans. Multimed. Comput. Commun. Appl. 2019, 15, 1–20. [Google Scholar] [CrossRef]
Das, S.; Dutta, S. A Fusion Approach for Collaborative Filtering. In Proceedings of the 2019 3rd International Conference on Innovation in Artificial Intelligence, Medan, Indonesia, 19–20 July 2019; pp. 263–269. [Google Scholar]
Long, J.; Chen, T.; Nguyen, Q.V.H.; Yin, H. Decentralized collaborative learning framework for next POI recommendation. ACM Trans. Inf. Syst. 2023, 41, 1–25. [Google Scholar] [CrossRef]
Cai, Z.; Yuan, G.; Qiao, S.; Qu, S.; Zhang, Y.; Bing, R. FG-CF: Friends-aware graph collaborative filtering for POI recommendation. Neurocomputing 2022, 488, 107–119. [Google Scholar] [CrossRef]
Li, T.; Zhang, J.; Bao, K.; Liang, Y.; Li, Y.; Zheng, Y. Autost: Efficient neural architecture search for spatio-temporal prediction. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Virtual Event, 6–10 July 2020; pp. 794–802. [Google Scholar]
Wang, D.; Liu, K.; Mohaisen, D.; Wang, P.; Lu, C.T.; Fu, Y. Towards semantically-rich spatial network representation learning via automated feature topic pairing. Front. Big Data 2021, 4, 89. [Google Scholar] [CrossRef]
Lu, B.; Gan, X.; Jin, H.; Fu, L.; Zhang, H. Spatiotemporal adaptive gated graph convolution network for urban traffic flow forecasting. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management, Virtual Event, 19–23 October 2020; pp. 1025–1034. [Google Scholar]
Kong, X.; Chen, Q.; Hou, M.; Rahim, A.; Ma, K.; Xia, F. RMGen: A tri-layer vehicular trajectory data generation model exploring urban region division and mobility pattern. IEEE Trans. Veh. Technol. 2022, 71, 9225–9238. [Google Scholar] [CrossRef]
Yang, S.; Liu, J.; Zhao, K. GETNext: Trajectory flow map enhanced transformer for next POI recommendation. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, 11–15 July 2022; pp. 1144–1153. [Google Scholar]
Shen, G.; Han, X.; Chin, K.; Kong, X. An attention-based digraph convolution network enabled framework for congestion recognition in three-dimensional road networks. IEEE Trans. Intell. Transp. Syst. 2021, 23, 14413–14426. [Google Scholar] [CrossRef]
Lu, B.; Gan, X.; Jin, H.; Fu, L.; Wang, X.; Zhang, H. Make more connections: Urban traffic flow forecasting with spatiotemporal adaptive gated graph convolution network. ACM Trans. Intell. Syst. Technol. 2022, 13, 1–25. [Google Scholar] [CrossRef]
Ning, Z.; Sun, S.; Wang, X.; Guo, L.; Guo, S.; Hu, X.; Hu, B.; Kwok, R.Y. Blockchain-enabled intelligent transportation systems: A distributed crowdsensing framework. IEEE Trans. Mob. Comput. 2021, 21, 4201–4217. [Google Scholar] [CrossRef]
Kong, X.; Gao, H.; Shen, G.; Duan, G.; Das, S.K. Fedvcp: A federated-learning-based cooperative positioning scheme for social internet of vehicles. IEEE Trans. Comput. Soc. Syst. 2021, 9, 197–206. [Google Scholar] [CrossRef]
Velickovic, P.; Fedus, W.; Hamilton, W.L.; Liò, P.; Bengio, Y.; Hjelm, R.D. Deep graph infomax. ICLR 2019, 2, 4. [Google Scholar]
Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. Adv. Neural Inf. Process. Syst. 2017, 30. [Google Scholar]
Kong, X.; Wang, K.; Hou, M.; Hao, X.; Shen, G.; Chen, X.; Xia, F. A federated learning-based license plate recognition scheme for 5G-enabled internet of vehicles. IEEE Trans. Ind. Inform. 2021, 17, 8523–8530. [Google Scholar] [CrossRef]
Chen, L.; Yuan, F.; Yang, J.; He, X.; Li, C.; Yang, M. User-specific Adaptive Fine-tuning for Cross-domain Recommendations. IEEE Trans. Knowl. Data Eng. 2021, 35, 3239–3252. [Google Scholar] [CrossRef]
Cheng, H.T.; Koc, L.; Harmsen, J.; Shaked, T.; Chandra, T.; Aradhye, H.; Anderson, G.; Corrado, G.; Chai, W.; Ispir, M.; et al. Wide & deep learning for recommender systems. In Proceedings of the 1st Workshop on Deep Learning for Recommender Systems, Boston, MA, USA, 15 September 2016; pp. 7–10. [Google Scholar]
Wang, R.; Fu, B.; Fu, G.; Wang, M. Deep & cross network for ad click predictions. In Proceedings of the ADKDD’17, Halifax, NS, Canada, 14 August 2017; pp. 1–7. [Google Scholar]
Kong, X.; Duan, G.; Hou, M.; Shen, G.; Wang, H.; Yan, X.; Collotta, M. Deep reinforcement learning-based energy-efficient edge computing for internet of vehicles. IEEE Trans. Ind. Inform. 2022, 18, 6308–6316. [Google Scholar] [CrossRef]
Wang, X.; Ning, Z.; Guo, S.; Wen, M.; Guo, L.; Poor, V. Dynamic UAV deployment for differentiated services: A multi-agent imitation learning based approach. IEEE Trans. Mob. Comput. 2021, 22, 2131–2146. [Google Scholar] [CrossRef]
Grover, A.; Leskovec, J. node2vec: Scalable feature learning for networks. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 855–864. [Google Scholar]
Zhang, Z.; Yang, H.; Bu, J.; Zhou, S.; Yu, P.; Zhang, J.; Ester, M.; Wang, C. ANRL: Attributed network representation learning via deep neural networks. In Proceedings of the IJCAI, Stockholm, Sweden, 13–19 July 2018; Volume 18, pp. 3155–3161. [Google Scholar]
Krizhevsky, A.; Sutskever, I.; Hinton, G.E. Imagenet classification with deep convolutional neural networks. Commun. ACM 2017, 60, 84–90. [Google Scholar] [CrossRef]
Cho, K.; Van Merriënboer, B.; Gulcehre, C.; Bahdanau, D.; Bougares, F.; Schwenk, H.; Bengio, Y. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv 2014, arXiv:1406.1078. [Google Scholar]
Wang, X.; Ning, Z.; Guo, L.; Guo, S.; Gao, X.; Wang, G. Mean-Field Learning for Edge Computing in Mobile Blockchain Networks. IEEE Trans. Mob. Comput. 2022. early access. [Google Scholar] [CrossRef]
Yang, M.; Qu, Q.; Shen, Y.; Zhao, Z.; Chen, X.; Li, C. An Effective Hybrid Learning Model for Real-Time Event Summarization. IEEE Trans. Neural Netw. Learn. Syst. 2020, 32, 4419–4431. [Google Scholar] [CrossRef]
Guo, H.; Tang, R.; Ye, Y.; Li, Z.; He, X. DeepFM: A factorization-machine based neural network for CTR prediction. arXiv 2017, arXiv:1703.04247. [Google Scholar]
He, X.; Liao, L.; Zhang, H.; Nie, L.; Hu, X.; Chua, T.S. Neural collaborative filtering. In Proceedings of the 26th International Conference on World Wide Web, Perth, Australia, 3–7 April 2017; pp. 173–182. [Google Scholar]
Huang, P.S.; He, X.; Gao, J.; Deng, L.; Acero, A.; Heck, L. Learning deep structured semantic models for web search using clickthrough data. In Proceedings of the 22nd ACM International Conference on Information & Knowledge Management, San Francisco, CA, USA, 27 October–1 November 2013; pp. 2333–2338. [Google Scholar]

Figure 1. Human-mobility-pattern-discovery architecture. The human mobility pattern extraction module consists of a convolutional neural network with a shared weight graph and a feedforward neural network. The encoding and decoding module based on the multi-head attention mechanism refers to the encoding and decoding structure of Transformer [31]. The upsampling module uses a deconvolutional network.

Figure 2. HMRec algorithm structure. The input of the deep model is a full amount of feature vectors, covering the geographic location features, time features, category features of POIs, and ID class features of users and POIs. The inputs to the Cross model are user ID, POI ID, and POI category.

Figure 3. ROC and PR curve visualization of experimental results of recommendation algorithms. The left figure depicts draws the ROC curve of each model. The right figure depicts the PR curve of each model.

Table 1. Results of urban traffic congestion prediction.

Evaluation Indicator	GCN-CNN-Trans	N2V-MLP-Trans	ANRL-MLP-Trans	GCN-MLP-GRU	HMPE
RMSE	0.1693	0.1517	0.1807	0.1473	0.1466
MAPE	34.70%	28.39%	44.71%	26.06%	25.65%

Table 2. Comparison of experimental results of the POI recommendation model.

Model Algorithm	ACC	AUC (ROC)	AUC (PR)
HMRec	0.8103	0.7331	0.3558
Wide&Deep	0.7978	0.7271	0.3499
DeepFM	0.8095	0.7253	0.3610
NeuralCF	0.7810	0.7078	0.3097
Twin Towers	0.6922	0.6097	0.1781
EmbeddingMLP	0.7966	0.7103	0.3302

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Song, J.; Yi, Q.; Gao, H.; Wang, B.; Kong, X. Exploring Prior Knowledge from Human Mobility Patterns for POI Recommendation. Appl. Sci. 2023, 13, 6495. https://doi.org/10.3390/app13116495

AMA Style

Song J, Yi Q, Gao H, Wang B, Kong X. Exploring Prior Knowledge from Human Mobility Patterns for POI Recommendation. Applied Sciences. 2023; 13(11):6495. https://doi.org/10.3390/app13116495

Chicago/Turabian Style

Song, Jingbo, Qiuhua Yi, Haoran Gao, Buyu Wang, and Xiangjie Kong. 2023. "Exploring Prior Knowledge from Human Mobility Patterns for POI Recommendation" Applied Sciences 13, no. 11: 6495. https://doi.org/10.3390/app13116495

APA Style

Song, J., Yi, Q., Gao, H., Wang, B., & Kong, X. (2023). Exploring Prior Knowledge from Human Mobility Patterns for POI Recommendation. Applied Sciences, 13(11), 6495. https://doi.org/10.3390/app13116495

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Exploring Prior Knowledge from Human Mobility Patterns for POI Recommendation

Abstract

1. Introduction

2. Related Work

2.1. POI Recommendation

2.2. Human Mobility Pattern

3. Methodology

3.1. Human Mobility Pattern Discovery

3.1.1. Spatio-Temporal Graph

3.1.2. Human Mobility Pattern Discovery Architecture

3.2. POI Recommendation Algorithm Based on Human Mobility Patterns

4. Results and Discussion

4.1. Urban Traffic Congestion Status Assessment Task

4.2. POI Recommendation Based on Human Mobility Pattern

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI