Next Article in Journal / Special Issue
The Effects of Tabular-Based Content Extraction on Patent Document Clustering
Previous Article in Journal
Forecasting the Unit Cost of a Product with Some Linear Fuzzy Collaborative Forecasting Models
Previous Article in Special Issue
Monitoring Threshold Functions over Distributed Data Streams with Node Dependent Constraints
Open AccessArticle

Contextual Anomaly Detection in Text Data

Department of Computer Science, University of Minnesota, 200 Union St SE, Minneapolis 55455, USA
*
Author to whom correspondence should be addressed.
Algorithms 2012, 5(4), 469-489; https://doi.org/10.3390/a5040469
Received: 20 June 2012 / Revised: 10 October 2012 / Accepted: 11 October 2012 / Published: 19 October 2012
We propose using side information to further inform anomaly detection algorithms of the semantic context of the text data they are analyzing, thereby considering both divergence from the statistical pattern seen in particular datasets and divergence seen from more general semantic expectations. Computational experiments show that our algorithm performs as expected on data that reflect real-world events with contextual ambiguity, while replicating conventional clustering on data that are either too specialized or generic to result in contextual information being actionable. These results suggest that our algorithm could potentially reduce false positive rates in existing anomaly detection systems. View Full-Text
Keywords: context detection; topic modeling; anomaly detection context detection; topic modeling; anomaly detection
Show Figures

Figure 1

MDPI and ACS Style

Mahapatra, A.; Srivastava, N.; Srivastava, J. Contextual Anomaly Detection in Text Data. Algorithms 2012, 5, 469-489.

Show more citation formats Show less citations formats

Article Access Map

1
Only visits after 24 November 2015 are recorded.
Back to TopTop