Outlier detection with autoencoder ensembles
Jinghui Chen, Saket Sathe, et al.
SDM 2017
The dramatic rise of time-series data in a variety of contexts, such as social networks, mobile sensing, data centre monitoring, etc., has fuelled interest in obtaining real-time insights from such data using distributed stream processing systems. One such extremely valuable insight is the discovery of correlations in real-time from large-scale time-series data. A key challenge in discovering correlations is that the number of time-series pairs that have to be analyzed grows quadratically in the number of time-series, giving rise to a quadratic increase in both computation cost and communication cost between the cluster nodes in a distributed environment. To tackle the challenge, we propose a framework called AEGIS. AEGIS exploits well-established statistical properties to dramatically prune the number of time-series pairs that have to be evaluated for detecting interesting correlations. Our extensive experimental evaluations on real and synthetic datasets establish the efficacy of AEGIS over baselines.
Jinghui Chen, Saket Sathe, et al.
SDM 2017
Zhixian Yan, Dipanjan Chakraborty, et al.
EDBT 2011
Daniel Lemes Gribel, Maira Gatti de Bayser, et al.
CIKM 2015
Sue Ann Chen, Arun Vishwanath, et al.
ISGT ASIA 2015