You are currently viewing a new version of our website. To view the old version click .
Sustainability
  • Article
  • Open Access

4 October 2019

Understanding Urban Mobility Pattern with Cellular Phone Data: A Case Study of Residents and Travelers in Nanjing

,
,
,
and
1
Jiangsu Key Laboratory of Urban ITS, Southeast University, Nanjing 211189, China
2
Jiangsu Province Collaborative Innovation Center of Modern Urban Traffic Technologies, Southeast University, Nanjing 211189, China
3
School of Transportation, Southeast University, Nanjing 211189, China
4
School of Highway, Chang’an University, Xi’an 710064, China

Abstract

The rapid development of urban metropolises has attracted a growing number of immigrants and travelers, increasing the burden on transportation systems. Previous research on urban mobility patterns have ignored the temporal variations and heterogeneity in divergent urban trip makers due to the limited data resolution and coverage. In this paper, we analyzed cellular phone data of more than five million travelers for one month in Nanjing, China and proposed a method to extract trip origin and destination information from cellular phone signal data. We found that mobility patterns are different for urban residents, short-term travelers, and transfer travelers, and that trip length distributions can best be described by gamma and exponential distributions. In addition to the daily trip length distribution models, we utilized the agglomerative hieratical clustering method in order to group similar hourly trip patterns and further proposed within-day trip length distribution models under different times of the day and days of the week.

1. Introduction

Understanding urban mobility patterns is essential in transportation engineering, such as the estimation of trip distributions in the transportation planning procedures, and traffic forecasting in traffic management. Traditional urban mobility information acquisition relies on urban residents’ travel surveys. In traditional travel surveys, methods such as questionnaires and telephone inquiries are used to obtain travel information such as travel destinations, travel time, and travel modes. The process takes a lot of manpower and the cost is expensive. Moreover, the accuracy of the self-reported travel times and distances has raised serious questions. The participants of the surveys are mostly residents of the city and travelers are often neglected from those kinds of surveys. However, these travelers contribute a huge number of trips, especially in large metropolitan areas.
With the rapid development of telecommunication technology, new pervasive data collection methods such as GPS and cellular phone technology have appeared to provide auxiliary traffic information sources with a larger sample size at a lower cost. Although GPS techniques could provide more accurate positioning results for individual users, vendors such as navigation providers have difficulty collecting a sufficiently large sample size compared with cellular phone service providers. Users may not use a navigation service for every trip, especially for their routine commuting trips, however cellular phone services can passively collect people’s trajectories as long as the users turn their phone on. With cellular phone data, we can extract the trip information from a large population size, including both residents and travelers. Currently, the difficulty of utilizing cellular phone data lies in its huge volume, and more efficient procedures are needed for extracting useful information from these large data sets.
This paper contributes to this effort by identifying different mobility patterns in a heterogeneous population group, including residents, short-term travelers, and transfer travelers using cellular phone data. The methodology developed for identifying trip ends using cellular phone data is proposed, followed by the results of a statistical analysis to model trip-length distributions for heterogeneous travelers at various time periods. These results might be useful for estimating origin–destination information for transportation planners.

3. Data and Methods

In this research, we have collected mobile phone signaling records from the largest cellular service provider in the city of Nanjing during October, 2017. Figure 1 displays the spatial distribution of the base stations in Nanjing. The city is divided into 38 traffic analysis zones by the Transportation Planning Agency. As indicated in Figure 1, the base stations are more densely distributed in the central area of the city than in the suburban area, which is consistent with the population distribution of Nanjing.
Figure 1. Spatial Distribution of the Base Stations in Nanjing. (a) Research Area. (b) Base Stations. (c) Base Station Count by Zone.
Figure 2 illustrates the methodology framework of the study. Firstly, the cellular phone data was sorted in ordered sequences and then cleansed to eliminate the noise data, including ping-pong and drifting data. Subsequently, trip information was extracted from those signal sequences. In order to examine the heterogeneity in the behavior of different trip makers, we classified the cellphone users into three different groups, namely residents, short-term travelers, and transfer travelers. Then, we tested four different models, including the exponential, Weibull, Gamma, and Rayleigh models in the modeling of daily trip length distribution models and the hourly trip length distribution models for the three groups of cellular phone users. Finally, the R-squared value, Kolmogorov-Smirnov test method, and visual comparison methods were used to evaluate the models.
Figure 2. Methodology Framework of the Study.

3.1. Cellular Phone Data

A complete cellular phone signal record contains multiple fields and we only kept the fields related to location information in order to reduce the volume of the original data. The data structure of the cellular phone signal record is: {MSID, Date_Time, LAC,CellID,EventID}. The definitions of each data field are listed as follows:
MSID: The phone number is encoded to protect users’ personal information. This field is unique for each individual user.
Date_Time: The time when the record is generated.
LAC: (Location Area Code): A location area refers to the area where mobile phones can move freely without reporting their location to the cellular service provider. When a cellphone moves out of a location area, a location update event will be triggered. A location area may cover several base station cells.
CellID: The unique ID for a base station cell, a base station cell refers to the service area covered by a base transceiver station (short for base station in the following context). The LAC and the CellID must be used together to uniquely identify a base station cell. The base station cell in the urban area generally covers less than 300 meters in radius range, and the distance between the two base stations in the suburb area may reach more than one kilometer. The spatial resolution can satisfy the traffic demand analysis requirements.
EventID: There are three types of event that can generate a mobile phone signaling record, including a location update, handover and phone calling.
-
Location update: When a mobile phone moves from one location area to another, a normal location update event will be generated. If the mobile phone generates no location update time for a very long time, a periodic location update event will be triggered to confirm the location of the mobile phone. The cycle length of the periodic location update event is about 1 hour.
-
Handover: When the mobile phone is on call and the user is moving from one base station cell to another, a handover event will be triggered to switch the call channel from the original base station to ensure the call quality.
-
Phone calling and messages: When the users make phone calls or send/receive text messages, a phone call event will be generated.
Figure 3 displays the average hourly volume of signaling records during the research period in Nanjing. The number of handover signals and call/message signals both peak at 9:00–11:00 and 16:00–18:00, which coincides with the normal peak traffic hours. The trends of the curves of the normal location update and the periodic location update appear to be opposite, since periodic location update signals are generated when mobile phones stay in the same location update area and no normal location update signal is triggered for one hour.
Figure 3. Average Hourly Volume of Cellular Phone Signals in Nanjing.

3.2. Identify Trip Origin-Destination Information using Cellular Phone Data

The responsibility of the cellular phone signaling system is to ensure the quality of the phone calling services, not to locate the accurate real-time position of the users. Due to the complexity of the telecommunication system, the original mobile phone positioning data obtained from the cellular service provider have a substantial amount of redundant data and disturbance data, therefore it is necessary to first filter and eliminate these noise data to improve the data quality. We need a procedure to cleanse the signal records and extract the users’ origin–destination. The method involves the following steps:
(1)
Sorting signal records in ordered sequences: The format of the original signal record is: {MSID, Date_Time, LAC, CellID, EventID} Firstly, we need to map the location of the base station using the LAC and CellID in order to obtain the geocoordinates of the base stations, and then sort each user’s signal records by the Date_Time to obtain each user’s trajectory.
(2)
Cleansing ping-pong signals and drifting signals: Ping-pong signals and drifting signals constitute the two major types of noise data in the users’ trajectory. Typically, the nearest base station to a user would take charge of the users’ phone service. When the workload of the nearest base station is large or the signal is weak, users may be allocated to a farther base station, which forms the drifting signal. In the telecommunicating peak hours, many base stations experience excessive workload, and the users may be switched back and forth between two adjacent base stations, which arouses the ping-pong signal.
Firstly, we need to traverse through the users’ trajectories and calculate the time difference, distance, and speed difference between every two consecutive signal records. Then, we can determine whether the signal is a ping-pong signal or a drifting signal by evaluating these differences.
When the drifting signal occurs, the user is instantaneously switched to a base station far away. Therefore, the speed of two consecutive signal records would be relatively large. If it is greater than the minimum drift speed threshold V1, we can determine that a drifting signal has occurred, and the drifting signal needs to be deleted. The minimum drift speed threshold V1 is recommended to be set to 1.2 times the maximum speed limit of the regional road.
Because most of the ping-pong signal occurs in a short time, it is necessary to set a time threshold, T1. If the difference between a signal record and its previous signal record is greater than the time threshold T1, it is not considered to be ping-pong data. The threshold T1 for filtering ping-pong data is recommended to be 600 s. In order to eliminate all ping-pong signals, each signal record i should be compared with its previous signal record, i–1, and its following signal record i+1. If the previous signal record i–1 and the following signal record i+1 are in the same location, then the signal record i can be determined as ping-pong data and needs to be deleted.
(3)
Identify trip ends: If a user stays in a region with a high density of base stations for a long time, the telecommunication system may randomly switch the base station of the user to nearby base stations in order to balance the telecommunication traffic. Therefore, changes of base stations in a user’s signal sequence may not indicate a trip. To identify a user’s stay area and the duration of the stay, the following method is applied in our study: Select K pieces of signal records in sequence, and set the initial value to K=2, then calculate the coordinates of the center of gravity point C of those K points. If the distance from the K points to the center of gravity point C is less than the threshold C1, then the K records are considered to be a stay rather than a trip. Let K=K+1 and continue the loop, the loop is stopped until the distance from the newly added point to the center of gravity C is greater than C1. The threshold C1 is set to be 500 meters and the maximum value of K is set to 5. A higher K value can increase the accuracy but would increase the complexity of the algorithm.

3.3. Identify Urban Residents, Short-Term Travelers and Transfer Travelers

The urban residents and travelers obviously have different travel behaviors, since urban residents have a fixed home in the city, they would take more routine commuting trips while the trips made by travelers are more diversified. We define “short-term travelers” as the travelers who stay in the city for more than one day while “transfer” travelers don’t stay overnight in the city. Transfer travelers maybe passengers who pass through the city or travelers who return after completing their specific trip purposes. Short-term travelers may be business travelers or tourists who would dwell in hotels. The trip mode preferences and trip length distributions of short-term travelers and transfer travelers may have significant difference. Therefore, we classified urban trip makers into these three categories.
In order to distinguish urban residents, short-term traveler, and transfer travelers, we need to collect and process mobile phone signaling data for several consecutive days, and then count the number of days that mobile phone users appear in the city under study, and divide the users into the three groups according to the proportion of users staying in the research cycle.
In this research, the analysis period is 30 days, and if the user appears less than 1 day, the user is considered to be a transfer traveler. If the user appears more than 1 day but less than 30 × 30%=9 days in the analysis period, the user is regarded as a short-term traveler.

3.4. Trip Length Distribution Model

The trip length distribution model describes the percentages of trips at each separation of trip distances. The most frequently used probability distributions used in trip length frequency distribution models include Gamma, Weibull, Exponential, and Rayleigh distributions.
1. Gamma Distribution
The gamma distribution is a two-parameter continuous distribution with its origin being zero. Functionally it is represented as follows [20,21]:
f x = b b a Γ a x a 1 e x b
where, a refers to the shape parameter, b is the scale parameter, and f(x) refers to the number of occurrence of trips of distance x.
2. Weibull Distribution
The Weibull distribution is generally represented as a three-parameter distribution, similarly to the gamma distribution, it becomes a two-parameter continuous distribution when the origin is zero. For strictly positive values of the shape parameter b and scale parameter a, it is represented as follows [22]:
f x = b a x a b 1 e x a b
3. Exponential Distribution
The exponential distribution is a commonly used distribution in reliability engineering, exhibiting a constant failure rate characteristic with respect to operating time. The exponential distribution is a special case of gamma distribution (obtained by setting a = 1). The exponential distribution is special because of its utility in modeling events that occur randomly over time—it is represented as follows [23]:
f x = 1 μ e x μ
where μ represents mean trip distance.
4. Rayleigh Distribution
The Rayleigh distribution is a special case of the Weibull distribution. If a and b are the parameters of the Weibull distribution [24], then the Rayleigh distribution with parameter b is equivalent to the Weibull distribution with parameters a = 2 b and b = 2. The Rayleigh distribution is represented as:
f x = x b 2 e x 2 2 b 2

3.5. Evaluation Methods

The value of the coefficient of determination (R-squared) is used in this research as the primary criterion in evaluating how well the theoretical distributions matched the observed distributions. The R-squared value is calculated using the following equation:
R 2 = x i x ¯ y i y ¯ σ x × σ y 2
where x i refers to the actual observed number of trips at a certain distance i, y i the number of trips calculated by the model at a certain distance i ,   x ¯ is the mean value of x i ,   y ¯ the mean value of y i ,   σ x the standard deviation of x i , and σ y is the standard deviation of y i .
The value of R-squared would fall between 0 and 1. A value of R-squared equal to 1 would imply that the model provides perfect predictions. The closer the value is to one indicates a greater ability to predict.
We use a two-sample Kolmogorov–Smirnov test method(KS test) in order to test whether two datasets belong to the same distribution. The two-sample KS test calculates the absolute distances between the cumulative distribution functions of the distributions of the two datasets. The maximum distance is then plugged into the KS probability function to calculate the probability value p. The test statistic is:
D * = max F 1 ^ x F 2 ^ x
where F 1 ^ x is the proportion of x1 values less than or equal to x and F 2 ^ x is the proportion of x2 values less than or equal to x. The lower the probability value p indicates the less likely the two distributions are similar.
In addition, a visual comparison of the actual trip length distribution and the predicted trip length trip distributions would also be used as an auxiliary measure to compare the quality of the models.

4. Results

In this section, we investigated the performance of the four distribution models introduced in the previous section in modeling trip length distributions for various traveler groups, including permanent residents, short-term travelers and transfer travelers in various time periods.
We have analyzed the cellphone signal data for a whole month in October 2016. About 150 million signaling records are processed for each day using the method presented in the previous section on a Hadoop system. We have identified 2.32, 0.664, and 0.193 million average daily resident users, short-term traveler users, and transfer users, respectively. The market penetration rate of the cellular service provider is about 62% in Nanjing and 45% of the users’ cellular records are collected by our system. Therefore, our sample size is about 28% of the population and we need to multiply 1/0.62/0.45 = 3.584 in order to calculate the total population in the city. As indicated in the 2016 survey, there are 8.274 million residents in Nanjing, which is very close to our result, 2.32 × 3.584 = 8.315 million. We also compared the number of residents calculated by the cellular data in each administrative zone with the survey results. As indicated in Figure 4, the maximum error in a zone is about 15%. Although the cellular data tend to underestimate the population, after calibrating the general error is very close to the survey.
Figure 4. Population in the Administrative Zone collected by Survey and Cellular Data.
Figure 5 displays the number of trips made by residents, short-term travelers, and transfer travelers during the 31 days. As indicated in the figure, residents contribute about half the amount of trips, while short-term travelers constitute more than 30% percent of the total trips, and transfer trips contribute about 10% of the total trips. The figure also suggests that the trip pattern differs significantly from holidays and weekends to normal weekdays, where the period between October 1 and October 7 belongs to the Chinese national holidays. Trips made by the residents are less in the holidays and weekends than in the normal weekdays. The transfer trips are higher in the holidays when short-term traveler trips are more frequent. The difference in short-term trips between holidays, weekends, and normal weekdays is not so significant compared with the other two groups of travelers.
Figure 5. Temporal Characteristics of Trips Made by Heterogeneous Population.

4.1. Daily Trip Length Distribution Models

Firstly, the trips are aggregated by 100 meters of trip distances, and resulted in 500 groups of trip distances in the range of 0–50 km. Then, the trip length distributions of the various population groups are compared using the Kolmogorov–Smirnov method in order to test whether they follow the same distribution statistically. Table 1 displays the KS test statistics of the pairwise comparison of the trip length distributions of the three population groups at the 5% significance level, the p-value of all the tests are close to 0 and h = 1, which indicates that the null hypothesis—that the trip distributions are from the same continuous distributions—is rejected.
Table 1. Kolmogorov–Smirnov Test Statistics for the Trip Length Distributions of Various Groups.
Figure 6 illustrates the trip length distribution characteristics of various population groups. As shown in Figure 6, the trip distances travelled by residents tends to be longer than transfer travelers and short-term travelers. The trip distances of short-term travelers are the lowest among all three population groups.
Figure 6. Trip Length Distribution of Various Population Groups.
As indicated in Table 1, as well as Figure 5 and Figure 6, the trip characteristics of the residents, short-term travelers, and transfer travelers differ significantly in terms of travel time and travel distances. Therefore, when modelling the trip length distribution characteristics, the heterogeneity in the travel behavior of different population groups should be considered. Therefore, we built separate trip length distribution models for each type of population group. We tested the four most popular models, including the exponential model, the Rayleigh model, the gamma model and the Weibull model in order to compare and select the best one—the results are listed in Figure 7.
Figure 7. Daily Trip Distance Distribution Models for Heterogeneous Population Groups.
As shown in Figure 7, the gamma and Weibull model methods outperform the exponential and Rayleigh methods in estimating the trip length distributions of all three population groups. The advantage in estimation accuracy could be attributed to the extra number of parameters in the gamma and Weibull model methods. For the two parameter models, the Weibull model method has slightly better advantage while in the one parameter models, the exponential model method is obviously better than the Rayleigh model method. In conclusion, if only one model parameter is allowed in the daily trip length distribution model, the exponential model method should be used, and if the number of model parameters is not so sensitive, the Weibull model method is recommended.

4.2. Within-Day Trip Distances Distribution Models

From the analysis in the previous section, the trip patterns of the residents, short-term travelers, and transfer travelers vary with different time periods in a day. In traditional transportation planning applications, the daily trip distance distribution model is fundamental in the gravity model [25,26]. When it comes to dynamic trip demand estimation, the time-varying property of the trip length function should be taken into account. This section presents the within-day trip distance distribution models.
Firstly, we compared the weekday and weekend travel patterns in Figure 8 to find that residents and short-term travelers tend to travel more on weekdays than on weekends, while for transfer travelers the difference is not significant.
Figure 8. Trip Distance Distributions on Weekdays and Weekends.
In order to examine the fluctuation in the hourly trip length distributions, we plotted the hourly trip length distribution of the residents in Figure 9 (the hourly trip distribution of the short-term travelers and transfer travelers can be found in Figure A1 and Figure A2 of the Appendix A). The red dots refer to the weekday trips, while the blue dots represent the weekend trips. As indicated in Figure 9, the trip length distribution pattern can be approximately divided into two categories. In the late night and early morning, trips tend to be shorter—meanwhile, in the daytime, there are longer trips.
Figure 9. Hourly Trip Length Distribution of the Resident Group.
Although developing separate models for each hour may be more accurate numerically, the complexity of the models would bring difficulty in practice. We adopted the hieratical agglomerative clustering method to group similar hours, so the similar hours in the same group could share the same trip length distribution model. The hourly trip length distribution model can be classified into six scenarios according to the population group (resident/short-term traveler/transfer traveler) and by weekday/weekend. Dendrograms of the hourly trip length distribution of the six scenarios are plotted in Figure 10. For all of the six scenarios, the hours can obviously be classified into two groups: the morning hours group and the night hours group, but there are slightly differences in the division of the hours. For residents, the hours in the weekdays are divided into morning hours (7:00–21:00) and night hours (1:00–6:00; 22:00–24:00), the hours in the weekends are divided into morning hours (7:00–22:00) and night hours (1:00–6:00; 23:00–24:00). For short-term travelers, the hours in the weekdays and weekends are both divided into morning hours (8:00–19:00) and night hours (1:00–7:00; 20:00–24:00). For transfer travelers, the hours in the weekdays are divided into morning hours (8:00-19:00) and night hours (1:00–7:00; 20:00–24:00), and the hours in the weekends are divided into morning hours (7:00–22:00) and night hours (1:00–6:00; 23:00–24:00).
Figure 10. Dendrogram of Hourly Trip Length Distribution.
In order to test whether the division of the six scenarios is reasonable, we use the KS method to test the differences in the distributions for various weekday/weekend and morning/night combinations. The test results are displayed in Table 2, and indicate that all the pairwise comparisons are not in the same continuous distribution at the 5% significance level. Therefore, the division of the six scenarios is plausible and we can build separate trip length distribution models for each scenario.
Table 2. Kolmogorov–Smirnov Test Statistics for the Various Hourly Trip Length Distributions.
Similar to the process to obtain daily trip length distribution models, we compared the four models, including the exponential model, the Rayleigh model, the gamma model, and the Weibull model to compare and select the best within-day trip length distribution model for the forementioned 12 conditions. The R-squared values of the results of these four models under 12 conditions are listed in Table 3. Similarly, the Weibull model is found to have the best accuracy and, for one-parameter models, the exponential model produces the better results. The model parameters of the Weibull model and the exponential model are listed in Table 4.
Table 3. R-Squared Values of the Trip Length Distribution Models.
Table 4. Parameters of the Weibull-based Trip Length Distribution Models.

5. Conclusions

This paper explores trip length distribution functions for heterogeneous population groups utilizing cellphone signaling data at a large-scale (5 million individuals). We investigated the feasibility of using four models, including the exponential model, the Weibull model, the Rayleigh model, and the gamma model, in modeling the daily trip length distributions for heterogeneous population groups. We found that the trip distributions are different for the three distinct groups of residents, namely residents, short-term travelers, and the transfer travelers. In addition, we investigated the statistical variations of trip distribution models caused by different time periods and found that the trip distributions for the daytime and nighttime are distinct. We also found differences in the trip distribution patterns between weekdays and the weekend. In order to improve the accuracy of the models, we proposed a clustering method to group different hours in a day, and proposed separate trip length distribution models for different partitions of the day for each traveler group. Our results suggested that the gamma distribution performs well in trip length distribution modeling under various conditions, and the exponential model is also suitable for trip length distribution models when the number of model parameters is sensitive in application.
Our work extends the traditional trip length distribution method with a new perspective on the heterogeneity of the traveler group and the different time periods, and provides an important decision-making basis for urban transportation planning. However, we still recognize two major drawbacks that arise while using cellular phone data to model urban mobility patterns. Firstly, the market share of the mobile phone operator may be uneven for different parts of the city, which can affect the statistical results. Other data sources, such as loop detector data or video surveillance data, could be used to calibrate the market penetration rate to improve the accuracy. Secondly, cellular phone data could not provide users’ personal social-economical information due to privacy issues, which is indispensable when analyzing the impact of individuals’ travel behavior.
Addressing these aforementioned limitations will be part of our future work. In addition, the study of trip length distribution characteristics also has the following applications and research prospects.
(1) It can be used as the impedance function of the trip distribution model to improve the transportation planning procedure to obtain origin–destination matrices of better precision for residents, short-term travelers, and transfer travelers. The results could provide supportive information for transportation agencies in the traffic management procedure, such as traffic control in holidays and special events and policy making in license plate number restrictions, etc.
(2) When planning for new roadways, the trip volume and the travel probability under a certain trip distance can be used as key indicators in determining the road function class.
(3) This method could also be applied in other cities where cellular data is available to explore the trip length distribution pattern in cities of various gross domestic products, populations, area size, and so forth, and identify the influencing factors.

Author Contributions

Methodology, F.Y.; data curation, Z.Y.; writing—original draft preparation, F.Y.; writing—review and editing, H.T.; visualization, F.D.; supervision, B.R.

Funding

This research was partially funded by National Natural Science Foundation of China, grant number 71701044, Projects of International Cooperation and Exchange of the National Natural Science Foundation of China (No. 51561135003), and Fundamental Research Funds for the Central Universities (No. 300102219301).

Conflicts of Interest

The authors declare no conflict of interest

Appendix A

The hourly trip distribution of short-term travelers and transfer travelers are illustrated in Figure A1 and Figure A2. The red dots refer to the weekday trips and the blue dots represent weekend trips.
Figure A1. Hourly Trip Length Distribution of the Short-Term Traveler Group.
Figure A2. Hourly Trip Length Distribution of the Transfer Traveler Group.

References

  1. Zhang, J.; Zheng, Y.; Qi, D. Deep Spatio-Temporal Residual Networks for Citywide Crowd Flows Prediction. In Proceedings of the In Thirty-First AAAI Conference on Artificial Intelligence, San Francisco, CA, USA, 4–9 February 2016. [Google Scholar]
  2. Hoang, M.X.; Yu, Z.; Singh, A.K. FCCF: Forecasting citywide crowd flows based on big data. In Proceedings of the 24th ACM SIGSPATIAL International Conference, San Francisco, CA, USA, 31 October–3 November 2016. [Google Scholar]
  3. Zhang, D.; Huang, J.; Li, Y.; Zhang, F.; Xu, C.; He, T. Exploring human mobility with multi-source data at extremely large metropolitan scales. In Proceedings of the International Conference on Mobile Computing & Networking, Paris, France, 7–11 September 2015. [Google Scholar]
  4. Yuan, J.; Zheng, Y.; Xie, X. Discovering regions of different functions in a city using human mobility and POIs. In Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining, Beijing, China, 2012. [Google Scholar]
  5. González, M.C.; Hidalgo, C.A.; Barabasi, A.-L. Understanding individual human mobility patterns. Nature 2008, 453, 779–782. [Google Scholar] [CrossRef] [PubMed]
  6. Jonker, N.J.; Venter, C.J. Modeling Trip-Length Distribution of Shopping Center Trips from GPS Data. Transp. Engin. Part A Sys. 2016. [Google Scholar] [CrossRef]
  7. Holguín-Veras, J.; Thorson, E. Trip Length Distributions in Commodity-Based and Trip-Based Freight Demand Modeling: Investigation of Relationships. Transp. Res. Rec. J. Transp. Res. Board 2000, 1707, 37–48. [Google Scholar] [CrossRef]
  8. Noulas, A.; Scellato, S.; Lambiotte, R.; Pontil, M.; Mascolo, C. Correction: A Tale of Many Cities: Universal Patterns in Human Urban Mobility. PLoS ONE 2012, 7, 37027. [Google Scholar] [CrossRef]
  9. Brockmann, D.; Hufnagel, L.; Geisel, T. The scaling laws of human travel. Nature 2006, 439, 462–465. [Google Scholar] [CrossRef] [PubMed]
  10. Alessandretti, L.; Sapiezynski, P.; Lehmann, S.; Baronchelli, A. Multi-scale spatio-temporal analysis of human mobility. PLoS ONE 2017, 12, 0171686. [Google Scholar] [CrossRef]
  11. Liu, T.; Yang, Z.; Zhao, Y.; Wu, C.; Zhou, Z.; Liu, Y. Temporal understanding of human mobility: A multi-time scale analysis. PLoS ONE 2018, 13, e0207697. [Google Scholar] [CrossRef] [PubMed]
  12. The World Bank. Mobile cellular subscriptions (per 100 people). 2017. Available online: https://data.worldbank.org.cn/indicator/IT.CEL.SETS.P2 (accessed on 3 October 2019).
  13. Iqbal, M.S.; Choudhury, C.F.; Wang, P.; González, M.C. Development of origin-destination matrices using mobile phone call data. Transp. Res. 2014, 40, 63–74. [Google Scholar] [CrossRef]
  14. Calabrese, F.; Lorenzo, G.; Liu, L.; Ratti, C. Estimating Origin-Destination Flows Using Mobile Phone Location Data. IEEE Perv. Comp. 2011, 10, 36–44. [Google Scholar]
  15. Calabrese, F.; Colonna, M.; Lovisolo, P.; Parata, D.; Ratti, C. Real-Time Urban Monitoring Using Cell Phones: A Case Study in Rome. IEEE Trans. Intel. Transp. Sys. 2011, 12, 141–151. [Google Scholar]
  16. Calabrese, F.; Diao, M.; Di Lorenzo, G.; Ferreira, J., Jr.; Ratti, C. Understanding individual mobility patterns from urban sensing data: A mobile phone trace example. Transp. Res. Part C: Emerg. Tech. 2013, 26, 301–313. [Google Scholar]
  17. Sagl, G.; Loidl, M.; Beinat, E. A Visual Analytics Approach for Extracting Spatio-Temporal Urban Mobility Information from Mobile Network Traffic. Isprs. Int J. Geo.-Inf. 2012, 1, 256–271. [Google Scholar] [CrossRef]
  18. Ranjan, G.; Zang, H.; Zhang, Z.L.; Bolot, J. Are call detail records biased for sampling human mobility? ACM SIGMOBILE 2012, 16, 33–44. [Google Scholar] [CrossRef]
  19. Sagl, G.; Delmelle, E.; Delmelle, E. Mapping collective human activity in an urban environment based on mobile phone data. Cartogr. Geogr. Inf. Sci. 2014, 41, 272–285. [Google Scholar] [CrossRef]
  20. Mathworks. The Gamma Distribution. 2019. Available online: https://www.mathworks.com/help/stats/gamma-distribution.html (accessed on 3 October 2019).
  21. Chen, D.; Fu, X.; Ding, W.; Li, H.; Xi, N.; Wang, Y. Shifted gamma distribution and long-range prediction of round trip timedelay for internet-based teleoperation. In Proceedings of the 2008 IEEE International Conference on Robotics and Biomimetics, Bangkok, Thailand, 22–25 February 2008. [Google Scholar]
  22. Chen, W.; Gao, Q.; Xiong, H.G. Uncovering urban mobility patterns and impact of spatial distribution of places on movements. Inter. J. Modern Phy. C. 2017, 28, 25. [Google Scholar] [CrossRef]
  23. Evans, A.W. The calibration of trip distribution models with exponential or similar cost functions. Transp. Res. 1971, 5, 15–38. [Google Scholar] [CrossRef]
  24. Weik, M.H. Rayleigh distribution. In Encyclopedia of Statistical Sciences; Wiley: Hoboken, NJ, USA, 2016; p. 1416. [Google Scholar]
  25. Khadaroo, J.; Seetanah, B. The Role of Transport Infrastructure in International Tourism Development: A Gravity Model Approach. Tour. Manag. 2008, 29, 831–840. [Google Scholar] [CrossRef]
  26. Karemera, D.; Oguledo, V.I.; Davis, B. A gravity model analysis of international migration to North America. Appl. Econ. 2000, 32, 1745–1755. [Google Scholar] [CrossRef]

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.