Algorithms 2012, 5(4), 469-489; doi:10.3390/a5040469

Contextual Anomaly Detection in Text Data

* email, email and email
Received: 20 June 2012; in revised form: 10 October 2012 / Accepted: 11 October 2012 / Published: 19 October 2012
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Abstract: We propose using side information to further inform anomaly detection algorithms of the semantic context of the text data they are analyzing, thereby considering both divergence from the statistical pattern seen in particular datasets and divergence seen from more general semantic expectations. Computational experiments show that our algorithm performs as expected on data that reflect real-world events with contextual ambiguity, while replicating conventional clustering on data that are either too specialized or generic to result in contextual information being actionable. These results suggest that our algorithm could potentially reduce false positive rates in existing anomaly detection systems.
Keywords: context detection; topic modeling; anomaly detection
PDF Full-text Download PDF Full-Text [3733 KB, uploaded 19 October 2012 08:46 CEST]

Export to BibTeX |

MDPI and ACS Style

Mahapatra, A.; Srivastava, N.; Srivastava, J. Contextual Anomaly Detection in Text Data. Algorithms 2012, 5, 469-489.

AMA Style

Mahapatra A, Srivastava N, Srivastava J. Contextual Anomaly Detection in Text Data. Algorithms. 2012; 5(4):469-489.

Chicago/Turabian Style

Mahapatra, Amogh; Srivastava, Nisheeth; Srivastava, Jaideep. 2012. "Contextual Anomaly Detection in Text Data." Algorithms 5, no. 4: 469-489.

Algorithms EISSN 1999-4893 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert