Collecting Typhoon Disaster Information from Twitter Based on Query Expansion
AbstractSocial media is a popular source of volunteered geographic information owing to its massive real-time data; however, the use of social media data in the context of geospatial analysis is challenging because complex semantic filters are required for the aggregation of geographic messages from the data streams. This article proposes a new query expansion method for social media streams which updates the query keywords periodically by the words extracted from the preceding search results. The proposed method has optimized the trade-off between precision and coverage of geographical messages by factoring in the influences of the keyword number and refresh cycle in the query process, and some improvements on the classic Term Frequency-Inverse Document Frequency (TF-IDF) method for short texts were achieved. Furthermore, a number of filters based upon relevance to the target topic were established and tested. This method was tested on a dataset from Twitter within the geographic extent of Macau in August 2017 during two consecutive typhoon hits. The result supports its effectiveness with a controllable precision and considerable increment of relevant information. Moreover, the query keywords can adjust themselves to the local language environment by discovering new keywords. To conclude, this query expansion method is able to provide a reliable method for social media-based information retrieval. View Full-Text
Share & Cite This Article
Chen, Z.; Lim, S. Collecting Typhoon Disaster Information from Twitter Based on Query Expansion. ISPRS Int. J. Geo-Inf. 2018, 7, 139.
Chen Z, Lim S. Collecting Typhoon Disaster Information from Twitter Based on Query Expansion. ISPRS International Journal of Geo-Information. 2018; 7(4):139.Chicago/Turabian Style
Chen, Zi; Lim, Samsung. 2018. "Collecting Typhoon Disaster Information from Twitter Based on Query Expansion." ISPRS Int. J. Geo-Inf. 7, no. 4: 139.
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.