Sentiment Analysis on Multimodal Transportation during the COVID-19 Using Social Media Data
Abstract
:1. Introduction
1.1. Related Work
1.2. Contributions of This Paper
2. Data Collection
2.1. Social Media Data
- Subway: subway, metroline, path, MTA, LIRR, shuttle, train, light rail, transit.
- Bus: bus, ferry, ferries, public transport.
- Car: taxi, car, vehicle, parking, cab, Uber, Lyft.
- Bike: bike, citibike, bicycle, bike share.
2.2. NYC Open Data
- Subway: NYC subway turnstile data provides the number of exits/entries in subway stations (http://web.mta.info/developers/turnstile.html, accessed on 1 March 2022).
- Bike: (1) Citibike provides the number of bike trips (https://ride.citibikenyc.com/system-data, accessed on 1 March 2022). (2) DOT (department of transportation) provides the aggregate bike usage in NYC (https://www1.nyc.gov/html/dot/html/bicyclists, accessed on 1 March 2022).
- Taxi: TLC (Taxi & Limousine Commission) trip record provides the number of taxi trips (https://www1.nyc.gov/site/tlc/about/tlc-trip-record-data.page, accessed on 1 March 2022).
3. Methodology
3.1. Data Preprocessing
3.2. Travel Mode Classifier
3.3. Sentiment Classifier
4. Results
- how people’s attitudes toward travel mode choices change during the pandemic; and
- how users’ demographics impact their attitude toward mode choices.
4.1. Sentiment Analysis on Travel Mode
- When the stay-at-home order began (March 2020), the number of tweets related to all travel modes and the mobility usage drastically decreased because travel demand decreased.
- When the reopening phase began (June 2020), the number of positive tweets related to bus, bike, and private vehicles increased. Users believed that these travel modes are reliable during the pandemic and many commuters shifted from subways to buses, bikes, and private vehicles.
- People were worried about being affected by those who do not wear masks on subways and buses. Public concerns about other modes (bikes, taxis/Ubers, and private vehicles) were about persistent issues. Many users cared about street conditions, parking spaces, bike lane usage, and the price of ride hailing.
4.2. Relationship between Sentiment and User Demographics
- Multinomial logistic regression: In the multinomial logistic regression model, we use the softmax function to normalize all features.
- Random forest: This is a tree-based model that ensembles all predictions from many decision trees by ranking the predictions.
- XGBoost: This stands for gradient boosted trees, which apply gradient descent methods to produce a strong prediction model from an ensemble of weak prediction models like decision trees.
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Goldbaum, C.; Hu, W. Why New York Buses Are on the Rise in a Subway City. 6 July 2020. Available online: https://www.nytimes.com/2020/07/06/nyregion/mta-buses-nyc-coronavirus.html (accessed on 11 January 2020).
- Chen, X.; Shea, R.; Di, X. Travel Pattern Analysis on Switching Behavior in Response to the COVID-19 Pandemic. arXiv 2023. submitted. [Google Scholar]
- MTA. Subway and Bus Ridership for 2019. 2020. Available online: https://new.mta.info/agency/new-york-city-transit/subway-bus-ridership-2019 (accessed on 11 January 2020).
- Cuba, J. NYPD: Bike Injuries Are Up 43 Percent during Coronavirus Crisis. 19 March 2020. Available online: https://nyc.streetsblog.org/2020/03/25/experts-senate-bills-25b-for-transit-wont-be-enough/ (accessed on 11 January 2020).
- Goldbaum, C. Is the Subway Risky? It May Be Safer than You Think. 2 August 2020. Available online: https://www.nytimes.com/2020/08/02/nyregion/nyc-subway-coronavirus-safety.html?referringSource=articleShare (accessed on 8 March 2020).
- Stringer, S.M. New York City’s Frontline Workers. 26 March 2020. Available online: https://comptroller.nyc.gov/reports/new-york-citys-frontline-workers/ (accessed on 11 January 2020).
- Zuo, F.; Wang, J.; Gao, J.; Ozbay, K.; Ban, X.J.; Shen, Y.; Yang, H.; Iyer, S. An interactive data visualization and analytics tool to evaluate mobility and sociability trends during COVID-19. arXiv 2020, arXiv:2006.14882. [Google Scholar]
- Bernardes, S.D.; Bian, Z.; Thambiran, S.S.M.; Gao, J.; Na, C.; Zuo, F.; Hudanich, N.; Bhattacharyya, A.; Ozbay, K.; Iyer, S.; et al. NYC Recovery at a Glance: The Rise of Buses and Micromobility. arXiv 2020, arXiv:2009.14019. [Google Scholar]
- Kamga, C.; Moghimi, B.; Vicuna, P.; Mudigonda, S.; Tchamna, R. Mobility Trends in New York City during COVID-19 Pandemic: Analyses of Transportation Modes throughout May 2020; University Transportation Research Center: New York, NY, USA, 2020. [Google Scholar]
- Wang, D.; He, B.Y.; Gao, J.; Chow, J.Y.; Ozbay, K.; Iyer, S. Impact of COVID-19 Behavioral Inertia on Reopening Strategies for New York City Transit. arXiv 2020, arXiv:2006.13368. [Google Scholar] [CrossRef]
- Hasan, S.; Ukkusuri, S.V. Urban activity pattern classification using topic models from online geo-location data. Transp. Res. Part Emerg. Technol. 2014, 44, 363–381. [Google Scholar] [CrossRef]
- Zheng, X.; Chen, W.; Wang, P.; Shen, D.; Chen, S.; Wang, X.; Zhang, Q.; Yang, L. Big data for social transportation. IEEE Trans. Intell. Transp. Syst. 2015, 17, 620–630. [Google Scholar] [CrossRef]
- Lee, J.H.; Davis, A.W.; Yoon, S.Y.; Goulias, K.G. Activity space estimation with longitudinal observations of social media data. Transportation 2016, 43, 955–977. [Google Scholar] [CrossRef]
- Zhang, Z.; He, Q.; Zhu, S. Potentials of using social media to infer the longitudinal travel behavior: A sequential model-based clustering method. Transp. Res. Part Emerg. Technol. 2017, 85, 396–414. [Google Scholar] [CrossRef]
- Shou, Z.; Cao, Z.; Di, X. Similarity Analysis of Spatial-Temporal Mobility Patterns for Travel Mode Prediction Using Twitter Data. In Proceedings of the 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC), Rhodes, Greece, 20–23 September 2020; pp. 1–6. [Google Scholar] [CrossRef]
- Chen, X.; Di, X. How the COVID-19 Pandemic Influences Human Mobility? Similarity Analysis Leveraging Social Media Data. In Proceedings of the 2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC), Macau, China, 8–12 October 2022; pp. 2955–2960. [Google Scholar]
- Yao, W.; Qian, S. From Twitter to traffic predictor: Next-day morning traffic prediction using social media data. Transp. Res. Part Emerg. Technol. 2021, 124, 102938. [Google Scholar] [CrossRef]
- Ye, Q.; Ozbay, K.; Zuo, F.; Chen, X. Impact of Social Media Use on Travel Behavior during COVID-19 Outbreak: Evidence from New York City. Transp. Res. Rec. 2020, grc-747481. [Google Scholar]
- Rahman, M.M.; Ali, G.N.; Li, X.J.; Samuel, J.; Paul, K.C.; Chong, P.H.; Yakubov, M. Socioeconomic factors analysis for COVID-19 US reopening sentiment with Twitter and census data. Heliyon 2021, 7, e06200. [Google Scholar] [CrossRef] [PubMed]
- Sadri, A.M.; Hasan, S.; Ukkusuri, S.V.; Cebrian, M. Exploring network properties of social media interactions and activities during Hurricane Sandy. Transp. Res. Interdiscip. Perspect. 2020, 6, 100143. [Google Scholar] [CrossRef]
- Wong, A.; Ho, S.; Olusanya, O.; Antonini, M.V.; Lyness, D. The use of social media and online communications in times of pandemic COVID-19. J. Intensive Care Soc. 2020, 22, 255–260. [Google Scholar] [CrossRef] [PubMed]
- Devlin, J.; Chang, M.W.; Lee, K.; Toutanova, K. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), Minneapolis, MN, USA, 2–7 June 2019. [Google Scholar]
- Chen, X.; Zeng, H.; Xu, H.; Di, X. Sentiment Analysis of Autonomous Vehicles After Extreme Events Using Social Media Data. In Proceedings of the 2021 IEEE International Intelligent Transportation Systems Conference (ITSC), Indianapolis, IN, USA, 19–22 September 2021; pp. 1211–1216. [Google Scholar]
- Szymkowski, S. Uber and Lyft Ride Costs Have Skyrocketed Amid the Pandemic. 2021. Available online: https://www.cnet.com/roadshow/news/uber-lyft-ride-costs-pandemic/ (accessed on 1 March 2022).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Chen, X.; Wang, Z.; Di, X. Sentiment Analysis on Multimodal Transportation during the COVID-19 Using Social Media Data. Information 2023, 14, 113. https://doi.org/10.3390/info14020113
Chen X, Wang Z, Di X. Sentiment Analysis on Multimodal Transportation during the COVID-19 Using Social Media Data. Information. 2023; 14(2):113. https://doi.org/10.3390/info14020113
Chicago/Turabian StyleChen, Xu, Zihe Wang, and Xuan Di. 2023. "Sentiment Analysis on Multimodal Transportation during the COVID-19 Using Social Media Data" Information 14, no. 2: 113. https://doi.org/10.3390/info14020113
APA StyleChen, X., Wang, Z., & Di, X. (2023). Sentiment Analysis on Multimodal Transportation during the COVID-19 Using Social Media Data. Information, 14(2), 113. https://doi.org/10.3390/info14020113