A Survey of Data Representation for Multi-Modality Event Detection and Evolution
Abstract
:Featured Application
Abstract
1. Introduction
2. Methodology
2.1. Retrieval Strategy
2.2. Selection Strategy
3. The Notion of Event Detection and Evolution
4. Single-Modality Data Representation and Event Detection
4.1. Text Based Data Representation
4.1.1. Term Based Methods
4.1.2. Topic Model Based Methods
4.1.3. Graph Based Methods
4.1.4. Deep Learning Based Methods
4.2. Visual Based Data Representation and Event Detection
4.2.1. Image Based Data Representation
4.2.2. Video Based Data Representation
5. Multi-Modality Data Representation and Event Detection
5.1. Feature Fusion Based Methods
5.2. Matrix Factorization Based Methods
5.3. Topic Model Based Methods
5.4. Deep Learning Based Methods
5.5. Other Methods
6. Multi-Modality Event Evolution
7. Datasets and Evaluation
7.1. Datasets
- (1)
- Social Event Detection (SED) dataset. The SED datasets are provided by the MediaEval challenge. The goal of this task is to discover social events of interest from the mass of user-generated Flickr multi-modality content and the metadata surrounding it. It released four subsets in 2011–2014: SED2011 [113], SED2012 [114], SED2013 [115], SED2014 [116]. Different years released different challenges. The statistic information of the datasets is shown in Table 5.
- (2)
- Multi-modality Social Event Dataset. This dataset is a multi-modality social event dataset (short for MMSE) downloaded from Flickr and has different versions. The datasets are composed of image and text data. The first version was released in 2014 and contains 36,000 documents that consist of 10 types of events [97], the second version was also released in 2014 and contains 59,500 documents that consist of 12 types of events [117], the third version was released in 2015 and contains 107,600 documents that consist of 18 types of events [96], the latest version named HFUT-mmdata was released in 2019 and contains 74,364 documents that consist of 10 types of events [118]. The details of these datasets are shown in Table 6.
7.2. Evaluation
8. Conclusions
9. Future Work
- (1)
- Take advantage of the diversity of multi-modality data. Multi-modality data types are not limited to modality diversity, such as textual and visual information, but also include social links and user behavior information such as comments and repost. Therefore, we should consider how to use this information in future research;
- (2)
- Event evolution problems. Although there have been many topic evolution studies on multi-modality data, which have yielded valuable results, there are still some problems such as no standard model evaluation benchmark for multi-modality topic evolution models. Therefore, an evaluation method for multi-modality topic evolution models should be proposed for the performance evaluation;
- (3)
- Lack of public datasets. Although there are some public datasets for multi-modality event detection, such as SED 2011–SED 2014 and multi-modality social event datasets mentioned above, these datasets still lack diversity. Therefore, it is necessary to develop more diversified datasets that contains more attribute information and conduct more comprehensive multi-modality event detection in the future;
- (4)
- Improve the interpretability of event detection models. The current event detection models are mostly black-box models. The model and results lack reasonable explanation. Therefore, we should focus on how to improve the interpretability of event detection models in the future;
- (5)
- Improve the data representation for multi-modality data. In existing multi-modality topic detection methods, modality fusion is mostly based on simple feature concatenation. In addition, time synchronization in different modalities is not considered. What is more, existing representations also have high time complexity, which is what we need to research in the future.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Conflicts of Interest
References
- Hu, L.; Zhang, B.; Hou, L.; Li, J. Adaptive online event detection in news streams. Knowl. Based Syst. 2017, 138, 105–112. [Google Scholar] [CrossRef]
- Sayyadi, H.; Raschid, L. A graph analytical approach for topic detection. ACM Trans. Internet Technol. 2013, 132, 1–23. [Google Scholar] [CrossRef]
- Spina, D.; Gonzalo, J.; Amigó, E. Learning similarity functions for topic detection in online reputation monitoring. In Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval, Gold Coast, Australia, 6–11 July 2014; pp. 527–536. [Google Scholar]
- Liu, H.; Li, B.W. Hot topic detection research of internet public opinion based on affinity propagation clustering. Lect. Notes Electr. Eng. 2012, 107, 261–269. [Google Scholar]
- Sakaki, T.; Okazaki, M.; Matsuo, Y. Tweet analysis for real-time event detection and earthquake reporting system development. IEEE Trans. Knowl. Data Eng. 2012, 25, 919–931. [Google Scholar] [CrossRef]
- Chen, K.Y.; Luesukprasert, L.; Chou, S. Hot Topic Extraction Based on Timeline Analysis and Multidimensional Sentence Modeling. IEEE Trans. Knowl. Data Eng. 2007, 19, 1016–1025. [Google Scholar] [CrossRef]
- Mottaghinia, Z.; Feizi-Derakhshi, M.R.; Farzinvash, L.; Salehpour, P. A review of approaches for topic detection in Twitter. J. Exp. Theor. Artif. Intell. 2021, 33, 747–773. [Google Scholar] [CrossRef]
- Atefeh, F.; Khreich, W. A survey of techniques for event detection in twitter. Comput. Intell. 2015, 31, 132–164. [Google Scholar] [CrossRef]
- Garg, M.; Kumar, M. Review on event detection techniques in social multimedia. Online Inf. Rev. 2016, 40, 347–361. [Google Scholar] [CrossRef]
- Panagiotou, N.; Katakis, I.; Gunopulos, D. Detecting events in online social networks: Definitions, trends and challenges. In Solving Large Scale Learning Tasks. Challenges and Algorithms; Springer: Cham, Switzerland, 2016; pp. 42–84. [Google Scholar]
- Goswami, A.; Kumar, A. A survey of event detection techniques in online social networks. Soc. Netw. Anal. Min. 2016, 6, 1–25. [Google Scholar] [CrossRef]
- Tzelepis, C.; Ma, Z.; Mezaris, V.; Ionescu, B.; Kompatsiaris, I.; Boato, G.; Sebe, N.; Yan, S. Event-based media processing and analysis: A survey of the literature. Image Vis. Comput. 2016, 53, 3–19. [Google Scholar] [CrossRef] [Green Version]
- Zeppelzauer, M.; Schopfhauser, D. Multimodal classification of events in social media. Image Vis. Comput. 2016, 53, 45–56. [Google Scholar] [CrossRef] [Green Version]
- Zhou, H.; Yu, H.; Hu, R.; Hu, J. A survey on trends of cross-media topic evolution map. Knowl.-Based Syst. 2017, 124, 164–175. [Google Scholar] [CrossRef]
- Liu, T.; Xue, F.; Sun, J.; Sun, X. A survey of event analysis and mining from social multimedia. Multimed. Tools Appl. 2020, 79, 33431–33448. [Google Scholar] [CrossRef]
- Zhou, H.; Yin, H.; Zheng, H.; Li, X. A survey on multi-modal social event detection. Knowl.-Based Syst. 2020, 195, 105695. [Google Scholar] [CrossRef]
- Nallapati, R.; Feng, A.; Peng, F.; Allan, J. Event threading within news topics. In Proceedings of the Thirteenth ACM International Conference on Information and Knowledge Management, Washington, DC, USA, 8–13 November 2004; pp. 446–453. [Google Scholar]
- Zhang, Y.; Duan, Y.; Huang, G.; Jiang, Y. A Survey on Topic Detection and Tracking Methods in Social Media. J. Chin. Inf. Processing 2019, 33, 1–10. [Google Scholar]
- James, A. Topic Detection and Tracking: Event Based Information Organization; Springer: Berlin/Heidelberg, Germany, 2012. [Google Scholar]
- Salton, G.; Wong, A.; Yang, C.S. A vector space model for automatic indexing. Commun. ACM 1975, 18, 613–620. [Google Scholar] [CrossRef] [Green Version]
- Jones, K.S. A Statistical Interpretation of Term Specificity and Its Application in Retrieval. J. Doc. 1972, 28. Available online: http://www.staff.city.ac.uk/~sbrp622/idfpapers/ksj_orig.pdf (accessed on 1 November 2021).
- Salton, G.; Buckley, C. Term-weighting approaches in automatic text retrieval. Inf. Process. Manag. 1988, 24, 323–328. [Google Scholar] [CrossRef] [Green Version]
- Zhang, K.; Zi, J.; Wu, L.G. New event detection based on indexing-tree and named entity. In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Amsterdam, The Netherlands, 23–27 July 2007; pp. 215–222. [Google Scholar]
- Mihalcea, R.; Tarau, P. Textrank: Bringing order into text. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, Barcelona, Spain, 25–26 July 2004; pp. 404–411. [Google Scholar]
- Yang, Y.; Pierce, T.; Carbonell, J.G. A study of retrospective and on-line event detection. In Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, Australia, 24–28 August 1998; pp. 28–36. [Google Scholar]
- Allan, J.; Papka, R.; Lavrenko, V. On-line new event detection and tracking. In Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, Australia, 24–28 August 1998; pp. 37–45. [Google Scholar]
- Petrović, S.; Osborne, M.; Lavrenko, V. Streaming first story detection with application to twitter. In Proceedings of the Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Los Angeles, CA, USA, 2–4 June 2010; pp. 181–189. [Google Scholar]
- Sankaranarayanan, J.; Samet, H.; Teitler, B.E.; Ieberman, M.D.; Sperling, J. Twitterstand: News in tweet. In Proceedings of the 17th ACM Sigspatial International Conference on Advances in Geographic Information Systems, Seattle, WA, USA, 4–6 November 2009; pp. 42–51. [Google Scholar]
- Vakali, A.; Giatsoglou, M.; Antaris, S. Social networking trends and dynamics detection via a cloud-based framework design. In Proceedings of the 21st International Conference on World Wide Web, Lyon, France, 16–20 April 2012; pp. 1213–1220. [Google Scholar]
- Brants, T.; Chen, F.; Farahat, A. A system for new event detection. In Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), Toronto, ON, Canada, 28 July–1 August 2003; pp. 330–337. [Google Scholar]
- Liu, L.; Kang, J.; Yu, J.; Wang, Z. A comparative study on unsupervised feature selection methods for text clustering. In Proceedings of the 2005 International Conference on Natural Language Processing and Knowledge Engineering, Wuhan, China, 30 October–1 November 2005; pp. 597–601. [Google Scholar]
- Blei, D.M.; Ng, A.Y.; Jordan, M.I. Latent dirichlet allocation. J. Mach. Learn. Res. 2003, 3, 993–1022. [Google Scholar]
- Dumais, S.T.; Furnas, G.W.; Landuaer, T.K. Indexing by latent semantic analysis. J. Am. Soc. Inf. Sci. 1990, 41, 391–407. [Google Scholar]
- Hofmann, T. Probabilistic latent semantic indexing. In Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Berkeley, CA USA, 15–19 August 1999; pp. 50–57. [Google Scholar]
- Blei, D.M.; McAuliffe, J.D. Supervised topic models. Adv. Neur. Inf. Process Syst. 2010, 3, 327–332. [Google Scholar]
- Andrzejewski, D.; Zhu, X.; Craven, M. Incorporating domain knowledge into topic modeling via Dirichlet forest priors. In Proceedings of the 26th Annual International Conference on Machine Learning, Montreal, QC, Canada, 14–18 June 2009; pp. 25–32. [Google Scholar]
- Blei, D.M.; Griffiths, T.L.; Jordan, M.I.; Tenenbaum, J.B. Hierarchical topic models and the nested Chinese restaurant process. In Advances Neural Information Processing Systems 16; MIT Press: Cambridge, MA, USA, 2004; Available online: https://proceedings.neurips.cc/paper/2003/file/7b41bfa5085806dfa24b8c9de0ce567f-Paper.pdf (accessed on 1 November 2021).
- Hou, L.; Li, J.; Wang, Z.; Tang, J.; Zhang, P.; Yang, R.; Zheng, Q. Newsminer: Multifaceted news analysis for event search. Knowl. Based Syst. 2015, 76, 17–29. [Google Scholar] [CrossRef]
- Li, Z.; Wang, B.; Li, M.; Ma, W. A probabilistic model for retrospective news event detection. In Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Salvador, Brazil, 15–19 August 2005; pp. 106–113. [Google Scholar]
- Zhang, C.; Wang, H.; Cao, L. A hybrid term–term relations analysis approach for topic detection. Knowl.-Based Syst. 2016, 93, 109–120. [Google Scholar] [CrossRef]
- Chen, Q.; Guo, X.; Bai, H. Semantic-based topic detection using markov decision processes. Neurocomputing 2017, 242, 40–50. [Google Scholar] [CrossRef]
- Mikolov, T.; Chen, K.; Corrado, G.; Dean, J. Efficient estimation of word representations in vector space. arXiv 2013, arXiv:1301.3781. [Google Scholar]
- Hu, L.; Yu, S.; Wu, B.; Shao, C.; Li, X. A neural model for joint event detection and prediction. Neurocomputing 2020, 407, 376–384. [Google Scholar] [CrossRef]
- Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
- Bodrunova, S.S.; Orekhov, A.V.; Blekanov, I.S.; Lyudkevich, N.S.; Tarasov, N.A. Topic detection based on sentence embeddings and agglomerative clustering with markov moment. Future Internet 2020, 12, 144. [Google Scholar] [CrossRef]
- Imran, N.; Liu, J.; Luo, J.; Shah, M. Event Recognition from Photo Collections via Pagerank. In Proceedings of the 17th ACM International Conference on Multimedia, Beijing, China, 19–24 October 2009; pp. 621–624. Available online: https://dl.acm.org/doi/epdf/10.1145/1631272.1631371 (accessed on 1 November 2021).
- Lowe, D.G. Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 2004, 60, 91–110. [Google Scholar] [CrossRef]
- Bossard, L.; Guillaumin, M.; Van Gool, L. Event recognition in photo collections with a stopwatch HMM. In Proceedings of the IEEE International Conference on Computer Vision, Washington, DC, USA, 1–8 December 2013; pp. 1193–1200. [Google Scholar]
- Dao, M.S.; Dang-Nguyen, D.T.; De Natale, F.G.B. Robust event discovery from photo collections using Signature Image Bases (SIBs). Multimed. Tools Appl. 2014, 70, 25–53. [Google Scholar] [CrossRef]
- Ruocco, M.; Ramampiaro, H. A scalable algorithm for extraction and clustering of event-related pictures. Multimed. Tools Appl. 2014, 70, 55–88. [Google Scholar] [CrossRef]
- Papadopoulos, S.; Zigkolis, C.; Kompatsiaris, Y. Cluster-based Landmark and Event Detection on Tagged Photo Collections. IEEE Multimed. 2011, 18, 52–63. [Google Scholar] [CrossRef]
- Cooper, M.; Foote, J.; Girgensohn, A.; Wilcox, L. Temporal event clustering for digital photo collections. ACM Trans. Multimed. Comput. Commun. Appl. (TOMM) 2005, 1, 269–288. [Google Scholar] [CrossRef]
- Guo, C.; Tian, X. Event recognition in personal photo collections using hierarchical model and multiple features. In Proceedings of the 2015 IEEE 17th International Workshop on Multimedia Signal Processing (MMSP), Xiamen, China, 19–21 October 2015; pp. 1–6. [Google Scholar]
- Guo, C.; Tian, X.; Mei, T. Multigranular event recognition of personal photo albums. IEEE Trans. Multimed. 2017, 20, 1837–1847. [Google Scholar] [CrossRef]
- Kaneko, T.; Yanai, K. Event photo mining from twitter using keyword bursts and image clustering. Neurocomputing 2016, 172, 143–158. [Google Scholar] [CrossRef]
- Zaharieva, M.; Zeppelzauer, M.; Breiteneder, C. Automated social event detection in large photo collections. In Proceedings of the 3rd ACM Conference on International Conference on Multimedia Retrieval, Dallas, TX, USA, 16–20 April 2013; pp. 167–174. [Google Scholar]
- Hamrouni, A.; Ghazzai, H.; Frikha, M.; Massoud, Y. A photo-based mobile crowdsourcing framework for event reporting. In Proceedings of the 2019 IEEE 62nd International Midwest Symposium on Circuits and Systems (MWSCAS), Dallas, TX, USA, 4–7 August 2019; pp. 198–202. [Google Scholar]
- Tang, K.; Yao, B.; Fei-Fei, L.; Koller, D. Combining the right features for complex event recognition. In Proceedings of the 2013 IEEE International Conference on Computer Vision, Sydney, Australia, 1–8 December 2013; pp. 2696–2703. [Google Scholar]
- Burghouts, G.J.; Geusebroek, J.-M. Performance evaluation of local colour invariants. Comput. Vis. Image Underst. 2008, 113, 48–62. [Google Scholar] [CrossRef]
- Lan, Z.Z.; Yang, Y.; Ballas, N.; Yu, S.; Haputmann, A. Resource constrained multimedia event detection. In International Conference on Multi Media Modeling; Springer: Cham, Switzerland, 2014; pp. 388–399. [Google Scholar]
- Tamrakar, A.; Ali, S.; Yu, Q.; Liu, J.; Javed, O.; Divakaran, A.; Cheng, H.; Sawhney, H. Evaluation of low-level features and their combinations for complex event detection in open source videos. In Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Washington, DC, USA, 16–21 June 2012; pp. 3681–3688. [Google Scholar]
- Ming, Y. Human activity recognition based on 3d mesh mosift feature descriptor. In Proceedings of the 2013 International Conference on Social Computing, Washington, DC, USA, 8–14 September 2013; pp. 959–962. [Google Scholar]
- Yang, Y.; Ma, Z.; Xu, Z.; Yan, S.; Hauptmann, A.G. How related exemplars help complex event detection in web videos? In Proceedings of the 2013 IEEE International Conference on Computer Vision, Sydney, Australia, 1–8 December 2013; pp. 2104–2111. [Google Scholar]
- Laptev, I. On space-time interest points. Int. J. Comput. Vis. 2005, 64, 107–123. [Google Scholar] [CrossRef]
- Klaser, A.; Marszałek, M.; Schmid, C. A spatio-temporal descriptor based on 3d-gradients. In Proceedings of the BMVC 2008-19th British Machine Vision Conference. British Machine Vision Association, Norwich, UK, 1–4 September 2008; Volume 275, pp. 1–10. [Google Scholar]
- Alamuru, S.; Jain, S. Video event detection, classification and retrieval using ensemble feature selection. Clust. Comput. 2021, 24, 2995–3010. [Google Scholar] [CrossRef]
- Merler, M.; Huang, B.; Xie, L.; Hua, G.; Natsev, A. Semantic model vectors for complex video event recognition. IEEE Trans. Multimed. 2011, 14, 88–101. [Google Scholar] [CrossRef] [Green Version]
- Gkalelis, N.; Mezaris, V.; Kompatsiaris, I. High-level event detection in video exploiting discriminant concepts. In Proceedings of the 2011 9th International Workshop on Content-Based Multimedia Indexing (CBMI), Madrid, Spain, 13–15 June 2011; pp. 85–90. [Google Scholar]
- Yu, Q.; Liu, J.; Cheng, H.; Divakaran, A.; Sawhney, H. Multimedia event recounting with concept based representation. In Proceedings of the 20th ACM International Conference on Multimedia, Nara, Japan, 29 October–2 November 2012; pp. 1073–1076. [Google Scholar]
- Gan, C.; Wang, N.; Yang, Y.; Yeung, D.Y.; Hauptmann, A.G. Devnet: A deep event network for multimedia event detection and evidence recounting. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 2568–2577. [Google Scholar]
- Xu, Z.; Yang, Y.; Hauptmann, A.G. A discriminative CNN video representation for event detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 1798–1807. [Google Scholar]
- Zha, S.; Luisier, F.; Andrews, W.; Srivastava, N.; Salakhutdinov, R. Exploiting image-trained CNN architectures for unconstrained video classification. arXiv 2015, arXiv:1503.04144. [Google Scholar]
- Ye, G.; Li, Y.; Xu, H.; Liu, D.; Chang, S.F. Eventnet: A large scale structured concept library for complex event detection in video. In Proceedings of the 23rd ACM International Conference on Multimedia, Brisbane, Australia, 26–30 October 2015; pp. 471–480. [Google Scholar]
- Tian, H.; Tao, Y.; Pouyanfar, S.; Chen, S.C.; Shyu, M.L. Multimodal deep representation learning for video classification. World Wide Web 2019, 22, 1325–1341. [Google Scholar] [CrossRef]
- Pouyanfar, S.; Chen, S.C. Automatic Video Event Detection for Imbalance Data Using Enhanced Ensemble Deep Learning. Int. J. Semant. Comput. 2017, 11, 85–109. [Google Scholar] [CrossRef]
- Xu, H.; Li, B.; Ramanishka, V.; Sigal, L.; Saenko, K. Joint event detection and description in continuous video streams. In Proceedings of the 2019 IEEE winter conference on applications of computer vision (WACV), Waikoloa Village, HI, USA, 7–11 January 2019; pp. 396–405. [Google Scholar]
- Zhang, L.; Xiang, X. Video event classification based on two-stage neural network. Multimed. Tools Appl. 2020, 79, 21471–21486. [Google Scholar] [CrossRef]
- Schinas, M.; Papadopoulos, S.; Petkos, G.; Kompatsiaris, Y.; Mitkas, P.A. Multimodal graph-based event detection and summarization in social media streams. In Proceedings of the 23rd ACM international conference on Multimedia, Brisbane, Australia, 26–30 October 2015; pp. 189–192. [Google Scholar]
- Brenner, M.; Izquierdo, E. Multimodal detection, retrieval and classification of social events in web photo collections. In Proceedings of the ICMR 2014 Workshop on Social Events in Web Multimedia (SEWM), Glasgow, UK, 2–4 April 2014; pp. 5–10. [Google Scholar]
- Sutanto, T.E.; Nayak, R. Admrg@ mediaeval 2013 social event detection. In Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop [CEUR Workshop Proceedings], Barcelona, Spain, 18–19 October 2013; Volume 1043, pp. 1–2. [Google Scholar]
- Yang, X.; Zhang, T.; Xu, C.; Hossain, M.S. Automatic visual concept learning for social event understanding. IEEE Trans. Multimed. 2015, 17, 346–358. [Google Scholar] [CrossRef]
- Mei, T.; Wang, B.; Hua, X.S.; Zhou, H.Q.; Li, S. Probabilistic multimodality fusion for event based home photo clustering. In Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, Toronto, ON, Canada, 9–12 July 2006; pp. 1757–1760. [Google Scholar]
- Singh, B.; Han, X.; Wu, Z.; Morariu, V.I.; Davis, L.S. Selecting relevant web trained concepts for automated event retrieval. In Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 13–16 December 2015; pp. 4561–4569. [Google Scholar]
- Xu, Z.; Tsang, I.W.; Yang, Y.; Ma, Z.; Hauptmann, A.G. Event detection using multi-level relevance labels and multiple features. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 23–28 June 2014; pp. 97–104. [Google Scholar]
- Ma, Y.; Li, Q.; Yang, Z.; Zheng, L.; Chan, A.B. An SVD-based Multimodal Clustering method for Social Event Detection. In Proceedings of the 2015 31st IEEE International Conference on Data Engineering Workshops, Seoul, Korea, 13–17 April 2015; pp. 202–209. [Google Scholar]
- Gupta, S.K.; Phung, D.; Adams, B.; Tran, T.; Venkatesh, S. Nonnegative shared subspace learning and its application to social media retrieval. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, 25–28 July 2010; pp. 1169–1178. [Google Scholar]
- Guan, Z.; Zhang, L.; Peng, J.; Fan, J. Multi-view concept learning for data representation. IEEE Trans. Knowl. Data Eng. 2015, 27, 3016–3028. [Google Scholar] [CrossRef]
- Zhu, F.; Shao, L.; Yu, M. Cross-modality submodular dictionary learning for information retrieval. In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, Shanghai, China, 3–7 November 2014; pp. 1479–1488. [Google Scholar]
- Xu, X.; Yang, Y.; Shimada, A.; Taniguchi, R.I.; He, L. Semi-supervised coupled dictionary learning for cross-modal retrieval in internet images and texts. In Proceedings of the 23rd ACM International Conference on Multimedia, Brisbane, Australia, 26–30 October 2015; pp. 847–850. [Google Scholar]
- Xue, Z.; Li, G.; Zhang, W.; Pang, J.; Huang, Q. Topic detection in cross-media: A semi-supervised co-clustering approach. Int. J. Multimed. Inf. Retr. 2014, 3, 193–205. [Google Scholar] [CrossRef]
- Yu, Z.; Wu, F.; Yang, Y.; Tian, Q.; Luo, J.; Zhuang, Y. Discriminative coupled dictionary hashing for fast cross-media retrieval. In Proceedings of the ACM SIGIR Conference Research and Development Information Retriev, Gold Coast, Australia, 6–11 July 2014; pp. 395–404. [Google Scholar]
- Yu, J.; Rui, Y.; Tao, D. Click prediction for web image reranking using multimodal sparse coding. IEEE Trans. Image Proc. 2014, 23, 2019–2032. [Google Scholar]
- Sharma, A.; Kumar, A.; Daume, H.; Jacobs, D.W. Generalized multiview analysis: A discriminative latent space. In Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, 16–21 June 2012; pp. 2160–2167. [Google Scholar]
- Bian, J.; Yang, Y.; Zhang, H.; Chua, T.S. Multimedia summarization for social events in microblog stream. IEEE Trans. Multimed. 2014, 17, 216–228. [Google Scholar] [CrossRef]
- Cai, H.; Yang, Y.; Li, X.; Zi, H. What are popular: Exploring twitter features for event detection, tracking and visualization. In Proceedings of the 23rd ACM International Conference on Multimedia, Brisbane, Australia, 26–30 October 2015; pp. 89–98. [Google Scholar]
- Qian, S.; Zhang, T.; Xu, C.; Hossain, M.S. Social event classification via boosted multimodal supervised latent dirichlet allocation. ACM Trans. Multimed. Comput. Commun. Appl. (TOMM) 2015, 11, 1–22. [Google Scholar] [CrossRef]
- Qian, S.; Zhang, T.; Xu, C. Multi-modal supervised latent dirichlet allocation for event classification in social media. In Proceedings of the International Conference on Internet Multimedia Computing and Service, Xiamen, China, 10–12 July 2014; pp. 152–157. [Google Scholar]
- Qian, S.; Zhang, T.; Xu, C.; Shao, J. Multi-modal event topic model for social event analysis. IEEE Trans. Multimed. 2015, 18, 233–246. [Google Scholar] [CrossRef]
- Chang, X.; Ma, Z.; Yang, Y.; Zeng, Z.; Hauptmann, A.G. Bi-Level Semantic Representation Analysis for Multimedia Event Detection. IEEE Trans. Cybern. 2017, 47, 1180–1197. [Google Scholar] [CrossRef] [PubMed]
- Lv, J.; Liang, J.; Yang, Z. HGE2MED: Heterogeneous Graph Embedding for Multi-domain Event Detection. In Proceedings of the 2020 IEEE 32nd International Conference on Tools with Artificial Intelligence (ICTAI), Baltimore, MD, USA, 9–11 November 2020; pp. 1036–1043. [Google Scholar]
- Abebe, M.A.; Tekli, J.; Getahun, F.; Chbeir, R.; Tekli, G. Generic metadata representation framework for social-based event detection, description, and linkage. Knowl.-Based Syst. 2020, 188, 104817. [Google Scholar] [CrossRef]
- Yang, Z.; Li, Q.; Liu, W.; Lv, J. Shared multi-view data representation for multi-domain event detection. IEEE Trans. Pattern Anal. Mach. Intell. 2019, 42, 1243–1256. [Google Scholar] [CrossRef] [PubMed]
- Huang, P.Y.; Liang, J.; Lamare, J.B.; Hauptmann, A.G. Multimodal filtering of social media for temporal monitoring and event analysis. In Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval, Yokohama, Japan, 11–14 June 2018; pp. 450–457. [Google Scholar]
- Metzler, D.; Cai, C.; Hovy, E. Structured event retrieval over microblog archives. In Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Montreal, QC, Canada, 3–8 June 2012; pp. 646–655. [Google Scholar]
- Li, W.; Joo, J.; Hang, Q.; Zhu, S.C. Joint Image-Text News Topic Detection and Tracking by Multimodal Topic And-Or Graph. IEEE Trans. Multimed. 2017, 19, 367–381. [Google Scholar] [CrossRef]
- Neo, S.Y.; Ran, Y.; Goh, H.K.; Zheng, Y.; Li, J. The use of topic evolution to help users browse and find answers in news video corpus. In Proceedings of the 15th ACM International Conference on Multimedia, Augsburg, Germany, 25–29 September 2007; pp. 198–207. [Google Scholar]
- Tan, S.; Ngo, C.W.; Tan, H.K.; Pang, L. Cross media hyperlinking for search topic browsing. In Proceedings of the 19th ACM International Conference on Multimedia, Scottsdale, AZ, USA, 28 November–1 December 2011; pp. 243–252. [Google Scholar]
- Wu, X.; Lu, Y.J.; Peng, Q.; Ngo, C.W. Mining event structures from web videos. IEEE MultiMedia 2011, 18, 38–51. [Google Scholar]
- Wang, D.; Li, T.; Ogihara, M. Generating Pictorial Storylines via Minimum-Weight Connected Dominating Set Approximation in Multi-View Graphs. In Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, Toronto, ON, Canada, 22–26 July 2012; pp. 683–689. Available online: https://www.aaai.org/ocs/index.php/AAAI/AAAI12/paper/view/5074/5257 (accessed on 1 November 2021).
- Shan, D.; Zhao, W.X.; Chen, R.; Shu, B.; Li, X. Eventsearch: A system for event discovery and retrieval on multi-type historical data. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Beijing, China, 12–16 August 2012; pp. 1564–1567. [Google Scholar]
- Xu, S.; Wang, S.; Zhang, Y. Summarizing complex events: A cross-modal solution of storylines extraction and reconstruction. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, WA, USA, 18–21 October 2013; pp. 1281–1291. [Google Scholar]
- Qian, S.; Zhang, T.; Xu, C. Online multimodal multiexpert learning for social event tracking. IEEE Trans. Multimed. 2018, 20, 2733–2748. [Google Scholar] [CrossRef]
- Papadopoulos, S.; Troncy, R.; Mezaris, V.; Huet, B.; Kompatsiaris, I. Social Event Detection at MediaEval 2011: Challenges, Dataset and Evaluation. In Proceedings of the MediaEval 2011, Pisa, Italy, 1–2 September 2011; Available online: http://mklab.iti.gr/project/sed2012 (accessed on 1 November 2021).
- Papadopoulos, S.; Schinas, E.; Mezaris, V.; Troncy, R.; Kompatsiaris, I. The 2012 social event detection dataset. In Proceedings of the 4th ACM Multimedia Systems Conference, Oslo, Norway, 26 February–1 March 2013; pp. 102–107. [Google Scholar]
- Reuter, T.; Papadopoulos, S.; Petkos, G.; Cimiano, P.; Vries, C.D.; Geva, S. Social event detection at mediaeval 2013: Challenges, datasets, and evaluation. In Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, Barcelona, Spain, 18–19 October 2013. [Google Scholar]
- Petkos, G.; Papadopoulos, S.; Mezaris, V.; Kompatsiaris, Y. Social Event Detection at MediaEval 2014: Challenges, Datasets, and Evaluation. In Proceedings of the MediaEval 2014, Barcelona, Spain, 16–17 October 2014; Available online: http://ceur-ws.org/Vol-1263/mediaeval2014_submission_5.pdf (accessed on 1 November 2021).
- Qian, S.; Zhang, T.; Xu, C. Boosted multi-modal supervised latent Dirichlet allocation for social event classification. In Proceedings of the 2014 22nd International Conference on Pattern Recognition, Stockholm, Sweden, 24–28 August 2014; pp. 1999–2004. [Google Scholar]
- Xue, F.; Hong, R.; He, X.; Wang, J.; Qian, S.; Xu, C. Knowledge-Based Topic Model for Multi-Modal Social Event Analysis. IEEE Trans. Multimed. 2019, 22, 2098–2110. [Google Scholar] [CrossRef]
- Brenner, M.; Izquierdo, E. MediaEval Benchmark: Social Event Detection in Collaborative Photo Collections. In Proceedings of the MediaEval, Pisa, Italy, 1–2 September 2011; Available online: http://ceur-ws.org/Vol-807/Brenner_SED_QMUL_me11wn.pdf (accessed on 1 November 2021).
- Papadopoulos, S.; Zigkolis, C.; Kompatsiaris, Y.; Telematics, I.; Vakali, A. CERTH@ MediaEval 2011 Social Event Detection Task. In Proceedings of the MediaEval, Pisa, Italy, 1–2 September 2011; Available online: https://oswinds.csd.auth.gr/sites/default/files/publications/pdf/C84.pdf (accessed on 1 November 2021).
- Liu, X.; Huet, B.; Troncy, R. EURECOM@ MediaEval 2011 Social Event Detection Task. In Proceedings of the MediaEval, Pisa, Italy, 1–2 September 2011; Available online: https://www.eurecom.fr/~troncy/Publications/Troncy-sed11.pdf (accessed on 1 November 2021).
- Wang, Y.; Xie, L.; Sundaram, H. Social Event Detection with Clustering and Filtering. In Proceedings of the MediaEval, Pisa, Italy, 4–5 October 2011; Available online: http://ceur-ws.org/Vol-807/Wang_SED_ANU_me11wn.pdf (accessed on 1 November 2021).
- Zeppelzauer, M.; Zaharieva, M.; Breiteneder, C. A Generic Approach for Social Event Detection in Large Photo Collections. In Proceedings of the MediaEval, Pisa, Italy, 4–5 October 2012; Available online: http://ceur-ws.org/Vol-927/mediaeval2012_submission_31.pdf (accessed on 1 November 2021).
- Vavliakis, K.N.; Tzima, F.A.; Mitkas, P.A. Event Detection via LDA for the MediaEval2012 SED Task. In Proceedings of the MediaEval, Pisa, Italy, 4–5 October 2012; Available online: http://ceur-ws.org/Vol-927/mediaeval2012_submis (accessed on 1 November 2021).
- Schinas, E.; Petkos, G.; Papadopoulos, S.; Kompatsiaris, Y. CERTH@ MediaEval 2012 Social Event Detection Task. In Proceedings of the MediaEval, Pisa, Italy, 4–5 October 2012; Available online: http://ceur-ws.org/Vol-927/mediaeval2012_submission_40.pdf (accessed on 1 November 2021).
- Dao, M.S.; Boato, G.; De Natale, F.G.B.; Nguyen, T.V. The Watershed-Based Social Events Detection Method with Support from External Data Sources. In Proceedings of the MediaEval, Pisa, Italy, 4–5 October 2012; Available online: http://citeseerx.ist.psu.edu/viewdoc/download (accessed on 1 November 2021).
- Rafailidis, D.; Semertzidis, T.; Lazaridis, M.; Strintzis, M.G.; Daras, P. A Data-Driven Approach for Social Event Detection. In Proceedings of the MediaEval, Barcelona, Spain, 18–19 October 2013; Available online: http://ceur-ws.org/Vol-1043/mediaeval2013_submission_8.pdf (accessed on 1 November 2021).
- Samangooei, S.; Hare, J.; Dupplaw, D.; Niranjan, M.; Preston, J. Social Event Detection via Sparse Multi-Modal Feature Selection and Incremental Density Based Clustering. In Proceedings of the MediaEval, Barcelona, Spain, 18–19 October 2013; Available online: http://ceur-ws.org/Vol-1043/mediaeval2013_submission_16.pdf (accessed on 1 November 2021).
- Manchon Vizuete, D.; Giró Nieto, X. Upc at Mediaeval 2013 Social Event Detection Task. In Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop. CEUR Workshop Proceedings, Barcelona, Spain, 16–17 October 2014; Available online: http://ceur-ws.org/Vol-1263/mediaeval2014_submission_58.pdf (accessed on 1 November 2021).
- Nguyen, T.V.; Dao, M.S.; Mattivi, R.; Sansone, E.; Natale, F.G.B.D.; Boato, G. Event Clustering and Classification from Social Media: Watershed-Based and Kernel Methods; MediaEval: Barcelona, Spain, 18–19 October 2013; Available online: http://ceur-ws.org/Vol-1043/mediaeval2013_submission_36.pdf (accessed on 1 November 2021).
- Wistuba, M.; Schmidt-Thieme, L. Supervised Clustering of Social Media Streams; MediaEval: Barcelona, Spain, 18–19 October 2013; Available online: http://ceur-ws.org/Vol-1043/mediaeval2013_submission_53.pdf (accessed on 1 November 2021).
- Brenner, M.; Izquierdo, E. MediaEval 2013: Social Event Detection, Retrieval and Classification in Collaborative Photo Collections. MediaEval Barcelona, Spain. 2013, 1043. Available online: http://ceur-ws.org/Vol-1043/mediaeval2013_submission_64.pdf (accessed on 1 November 2021).
- Yang, Z.; Li, Q.; Liu, W.; Ma, Y.; Cheng, M. Dual graph regularized NMF model for social event detection from Flickr data. World Wide Web 2017, 20, 995–1015. [Google Scholar] [CrossRef]
- Yang, Z.; Li, Q.; Lu, Z.; Ma, Y.; Gong, Z.; Liu, W. Dual structure constrained multimodal feature coding for social event detection from flickr data. ACM Trans. Internet Technol. (TOIT) 2017, 17, 1–20. [Google Scholar] [CrossRef]
- Zaharieva, M.; Schopfhauser, D.; Del Fabro, M.; Zeppelzauer, M. Clustering and Retrieval of Social Events in Flickr; MediaEval: Barcelona, Spain, 16–17 October 2014; Available online: http://ceur-ws.org/Vol-1263/mediaeval2014_submission_39.pdf (accessed on 1 November 2021).
- Bao, Y.; Collier, N.; Datta, A. A partially supervised cross-collection topic model for cross-domain text classification. In Proceedings of the 22nd ACM international conference on Information & Knowledge Management, San Francisco, CA, USA, 27 October–1 November 2013; pp. 239–248. [Google Scholar]
- Sang, J.; Xu, C. Right buddy makes the difference: An early exploration of social relation analysis in multimedia applications. In Proceedings of the 20th ACM international conference on Multimedia, Nara, Japan, 29 October–2 November 2012; pp. 19–28. [Google Scholar]
Reference | Detection Technique | Representation |
---|---|---|
[46] | SVM classification | SIFT descriptors, color descriptor |
[48] | Hidden Markov Model | (1) Global temporal features: time of day, day of week, month; (2) Low-level visual features: densely sampled SURF descriptors; (3) Higher-level visual features: the type of scene, type of indoor scene, number of faces, facial attributes over detected faces. |
[49] | Classification | Saliency, Gist and Time. |
[50] | Suffix Tree | Textual annotations, time and geographical positions. |
[51] | Hybrid image clustering | Visual features (scale invariant feature transform (SIFT)) and their tags |
[52] | Clustering | Temporal and content (low frequency DCT features) |
[53] | Coarse classification | Multiple features including time, objects (CNN features) and scenes (CNN feature) |
[54] | Hierarchical model | Representations based on attention network (including image, attribute, scene and time) |
[55] | Clustering | Visual information (bag-of-features with densely-sampled SURF local features and 64-dim RGB color histograms) as well as textual information (event keyword detection) |
[56] | Clustering | Visual dict (used PHOW features to construct a bag-of-visual-words model from the selected image set), text (type, location, time) |
[57] | Clustering | Combinations of photos visual (SIFT) and semantic features, and the photo proprieties |
Category | Reference | Representation | |
---|---|---|---|
Low-level feature | Static frame based visual features | [58] | Various static features including Scale-Invariant Feature Transform (SIFT) |
[60] | Three image features including SIFT, ColorSIFT and Transformed Color Histogram (TCH) | ||
Motion visual features | [61] | Motion SIFT (MoSIFT) | |
[64] | Spatio-temporal interest points (STIP), Motion SIFT and Dense Trajectories | ||
[58] | Various features including 3D Histogram of Oriented Gradients (HoG3D) | ||
[66] | Gradient local ternary pattern, histogram of oriented gradients and Tamura features | ||
High level representation | [67] | Semantic model vectors which is an intermediate level semantic representation extracted using a set of discriminative semantic classifiers. | |
[68] | Decomposed video to a sequence of shots and trained visual concept detectors are used to represent video content with model vector sequences. | ||
[69] | Static features (i.e., SIFT) and dynamic features (i.e., STIP and Dense Trajectory Based features) | ||
Deep learning based methods | [70] | Deep convolutional neural networks (CNN) | |
[71] | Deep convolutional neural networks (CNN) | ||
[72] | Deep convolutional neural networks (CNN) | ||
[73] | Deep convolutional neural networks (CNN) | ||
[74] | Deep convolutional neural networks (CNN) | ||
[75] | Enhanced Ensemble Deep Learning | ||
[76] | Three-dimensional convolutional layers | ||
[77] | Two-stage neural network strategy |
Category | Reference | Advantages | Disadvantages |
---|---|---|---|
Feature fusion based methods | [79] | Utilize various features | Lack of social network information |
[80] | Constrained clustering algorithm is used to achieved high accuracy | Imbalanced data and parameter setting is not optimal | |
[81] | Automatic concept mining and boosted concept learning | Application is limited | |
[82] | Unsupervised and without predefined threshold | Lack of more types of semantic features | |
[83] | Need no manual annotation and can adapt concepts to news domains | - | |
[84] | Utilized the information contained in the related exemplars | - | |
Matrix factorization based methods | [85] | Robust to data incompleteness | - |
[86] | Can discover the shared structure between the datasets | - | |
[90] | semi-supervised co-clustering with side information | Parameters setting is not automatic | |
[91] | Contain dictionary atoms that are semantically discriminative | - | |
[92] | Predict image clicks and solved the problem of lack of data | - | |
Topic model based methods | [94] | Generate visualized summaries | Lack of personalized microblog summarization |
[96] | exploit the multimodality and suitable for large-scale data | Without videos and audios modality | |
[97] | Exploit various property jointly and classify multi-class events | - | |
[98] | Can classify the visual-representative topics from non-visual-representative topics | Didn’t consider different domains | |
Deep learning based methods | [99] | restrains the negative influence of noisy or irrelevant concepts | - |
[100] | Maintain multi-view information by robust representation | - | |
[101] | A generic model to describe events and their relationships | - | |
Other Methods | [102] | Deal with multi-view tasks | Poor data quality and high complexity |
[103] | Jointly regularizing the encoded representations | Lack of event summarization and itemization | |
[104] | Retrieval structured event representation, robust and effective | - | |
[105] | Structured topic representation | - |
Models | Modality Data Type | Characteristic |
---|---|---|
[107] | News video, web news articles and news blogs | Topic evolution based on various online platforms and based on the interest of users. |
[108] | Web video, news article | Topic evolution with various media information, and enables the visualization and exploration of different information. |
[109] | Web video | A concise structure that shows the evolution of events associated with the representative text keywords and visual shots. |
[110] | Images, text | Text and image are combined to analysis and deliver the storyline and structural information to provide topic evolution sketch. |
[111] | Web news, newspaper, TV program, Weibo | Events in a very long time period are detected and traced, and a multimedia-based visualization is provided. |
[112] | Web news with image | Storyline extraction and reconstruction, a unified algorithm is designed to extract all effective storylines. |
[99] | Web news, image | An incremental learning strategy is adopting informative textual and visual topics of social events over time to help understand events and their evolutionary trends. |
[106] | News Text, image | Tracking topics detected in certain continuous time periods. Link all detected topics in different time periods to form topic trajectories over time. |
[113] | Web news, images | Nonparametric online multimodal tracking module that allows the tracked model to evolve backwards to undo undesirable model updates, which helps alleviate the model drift problem of social event tracking. |
Data | Time | Introduction |
---|---|---|
SED2011 | 2011 | Contains 73,645 images of five cities: Amsterdam, Barcelona, London, Paris and Rome. |
SED2012 | 2012 | Contains 167,332 images of five cities: Barcelona, Madrid, Cologne, Hamburg and Hannover |
SED2013 | 2013 | Contains 427,370 images and 1327 videos with XML timestamps, geographic information, tags, titles, descriptions, etc. |
SED2014 | 2014 | SED 2014 released two datasets. One dataset contains 362,578 images and the other dataset contains 110,541 images. The metadata of the two datasets are available. |
Dataset Name | Source | Number of Event Types | Content | Documents Number of Each Event | Total Documents Number | Year |
---|---|---|---|---|---|---|
MMSE (version 1) | Flickr | 10 | image and text | 2500~5000 | 36,000 | 2014 |
MMSE (version 2) | Flickr | 12 | image and text | 3000~6000 | 59,500 | 2014 |
MMSE (version 3) | Flickr | 18 | image and text | 4000~8000 | 107,600 | 2015 |
HFUT-mmdata | Flickr | 10 | image and text | 7000~9000 | 74,364 | 2019 |
Challenge 1 | Challenge 2 | Challenge 3 | |||||
---|---|---|---|---|---|---|---|
F-Score | NMI | F-Score | NMI | F-Score | NMI | ||
SED2011 | [119] | 0.69 | 0.41 | 0.33 | 0.54 | ||
[120] | 0.77 | 0.63 | 0.64 | 0.38 | |||
[121] | 0.59 | 0.27 | 0.69 | 0.62 | |||
[122] | 0.65 | 0.24 | 0.50 | 0.45 | |||
SED2012 | [123] | 0.22 | 0.02 | 0.30 | 0.20 | 0.48 | 0.31 |
[124] | 0.85 | 0.72 | 0.91 | 0.85 | 0.90 | 0.74 | |
[125] | 0.19 | 0.18 | 0.75 | 0.67 | 0.67 | 0.47 | |
[126] | 0.70 | 0.60 | NA | NA | 0.61 | 0.45 | |
SED2013 | [127] | 0.57 | 0.87 | ||||
[128] | 0.95 | 0.99 | |||||
[129] | 0.88 | 0.97 | |||||
[130] | 0.93 | 0.98 | |||||
[131] | 0.88 | 0.97 | |||||
[132] | 0.78 | 0.94 | |||||
SED2014 | [133] | NA | 0.98 | ||||
[134] | 0.97 | 0.99 | |||||
[135] | 0.94 | 0.98 |
SLAD (Visual) | SLDA (Text) | mmLDA + SG | mmLDA + SVM | mm- SLDA | BMM- SLDA | KGE- MMSLDA | |
---|---|---|---|---|---|---|---|
MMSE (version 1) | 0.359 | 0.758 | 0.699 | 0.755 | 0.803 | NA | |
MMSE (version 2) | 0.401 | 0.717 | 0.671 | 0.715 | 0.766 | 0.877 | |
MMSE (version 3) | 0.312 | 0.702 | 0.665 | 0.724 | 0.722 | 0.835 | |
HFUT-mmdata | 0.763 | 0.851 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Xiao, K.; Qian, Z.; Qin, B. A Survey of Data Representation for Multi-Modality Event Detection and Evolution. Appl. Sci. 2022, 12, 2204. https://doi.org/10.3390/app12042204
Xiao K, Qian Z, Qin B. A Survey of Data Representation for Multi-Modality Event Detection and Evolution. Applied Sciences. 2022; 12(4):2204. https://doi.org/10.3390/app12042204
Chicago/Turabian StyleXiao, Kejing, Zhaopeng Qian, and Biao Qin. 2022. "A Survey of Data Representation for Multi-Modality Event Detection and Evolution" Applied Sciences 12, no. 4: 2204. https://doi.org/10.3390/app12042204
APA StyleXiao, K., Qian, Z., & Qin, B. (2022). A Survey of Data Representation for Multi-Modality Event Detection and Evolution. Applied Sciences, 12(4), 2204. https://doi.org/10.3390/app12042204