Open AccessThis article is
- freely available
Contextual Anomaly Detection in Text Data
Department of Computer Science, University of Minnesota, 200 Union St SE, Minneapolis 55455, USA
* Author to whom correspondence should be addressed.
Received: 20 June 2012; in revised form: 10 October 2012 / Accepted: 11 October 2012 / Published: 19 October 2012
Abstract: We propose using side information to further inform anomaly detection algorithms of the semantic context of the text data they are analyzing, thereby considering both divergence from the statistical pattern seen in particular datasets and divergence seen from more general semantic expectations. Computational experiments show that our algorithm performs as expected on data that reflect real-world events with contextual ambiguity, while replicating conventional clustering on data that are either too specialized or generic to result in contextual information being actionable. These results suggest that our algorithm could potentially reduce false positive rates in existing anomaly detection systems.
Keywords: context detection; topic modeling; anomaly detection
Article StatisticsClick here to load and display the download statistics.
Notes: Multiple requests from the same IP address are counted as one view.
Cite This Article
MDPI and ACS Style
Mahapatra, A.; Srivastava, N.; Srivastava, J. Contextual Anomaly Detection in Text Data. Algorithms 2012, 5, 469-489.
Mahapatra A, Srivastava N, Srivastava J. Contextual Anomaly Detection in Text Data. Algorithms. 2012; 5(4):469-489.
Mahapatra, Amogh; Srivastava, Nisheeth; Srivastava, Jaideep. 2012. "Contextual Anomaly Detection in Text Data." Algorithms 5, no. 4: 469-489.