Abstract
We consider the problem of capturing correlations and finding hidden variables corresponding to trends on collections of time series streams. Our proposed method, SPIRIT, can incrementally find correlations and hidden variables, which summarise the key trends in the entire stream collection. It can do this quickly, with no buffering of stream values and without comparing pairs of streams. Moreover, it is any-time, single pass, and it dynamically detects changes. The discovered trends can also be used to immediately spot potential anomalies, to do efficient forecasting and, more generally, to dramatically simplify further data processing.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aggarwal, Charu C. Han, Jiawei, and Yu, Philip S. (2003). A framework for clustering evolving data streams. In VLDB.
Ali, M.H. Mokbel, Mohamed F. Aref, Walid, and Kamel, Ibrahim (2005). Detection and tracking of discrete phenomena in sensor network databases. In SSDBM.
Brockwell, Peter J. and Davis, Richard A. (1991). Time Series: Theory and Methods. Springer Series in Statistics. Springer-Verlag, 2nd edition.
Deligiannakis, Antonios, Kotidis, Yiannis, and Roussopoulos, Nick (2004). Compressing historical information in sensor networks. In SIGMOD.
Diamantaras, Kostas I. and Kung, Sun-Yuan (1996). Principal Component Neural Networks: Theory and Applications. John Wiley.
Domingos, Pedro and Hulten, Geoff (2000). Mining high-speed data streams. In KDD.
Fukunaga, Keinosuke (1990). Introduction to Statistical Pattern Recognition. Academic Press.
Ganti, Venkatesh, Gehrke, Johannes, and Ramakrishnan, Raghu (2002). Mining data streams under block evolution. SIGKDD Explorations, 3(2): 1–10.
Guha, Sudipto, Gunopulos, Dimitrios, and Koudas, Nick (2003a). Correlating synchronous and asynchronous data streams. In KDD.
Guha, Sudipto, Meyerson, Adam, Mishra, Nina, Motwani, Rajeev, and O’Callaghan, Liadan (2003b). Clustering data streams: Theory and practice. IEEE TKDE, 15(3):515–528.
Haykin, Simon (1992). Adaptive Filter Theory. Prentice Hall.
Hulten, Geoff, Spencer, Laurie, and Domingos, Pedro (2001). Mining time-changing data streams. In KDD.
Jolliffe, I.T. (2002). Principal Component Analysis. Springer.
Kailath, Thomas (1980). Linear Systems. Prentice Hall.
Keogh, Eamonn, Lonardi, Stefano, and Ratanamahatana, Chotirat Ann (2004). Towards parameter-free data mining. In KDD.
Lin, Jessica, Vlachos, Michail, Keogh, Eamonn, and Gunopulos, Dimitrios (2004). Iterative incremental clustering of time series. In EDBT.
Oja, Erkki (1989). Neural networks, principal components, and subspaces. Intl. J. Neural Syst., 1:61–68.
Palpanas, Themistoklis, Vlachos, Michail, Keogh, Eamonn, Gunopulos, Dimitrios, and Truppel, Wagner (2004). Online amnesic approximation of streaming time series. In ICDE.
Papadimitriou, Spiros, Brockwell, Anthony, and Faloutsos, Christos (2003). Adaptive, hands-off stream mining. In VLDB.
Sakurai, Yasushi, Papadimitriou, Spiros, and Faloutsos, Christos (2005). BRAID: Stream mining through group lag correlations. In SIGMOD.
Sun, Jimeng, Papadimitriou, Spiros, and Faloutsos, Christos (2005). Online latent variable detection in sensor networks. In ICDE. (demo).
Wang, Haixun, Fan, Wei, Yu, Philip S., and Han, Jiawei (2003). Mining concept-drifting data streams using ensemble classifiers. In KDD.
Yang, Bin (1995). Projection approximation subspace tracking. IEEE Trans. Sig. Proc., 43(1):95–107.
Yi, Byoung-Kee, Sidiropoulos, N.D. Johnson, Theodore, Jagadish, H.V. Faloutsos, Christos, and Biliris, Alexandras (2000). Online data mining for co-evolving time sequences. In ICDE.
Young, Peter (1984). Recursive Estimation and Time-Series Analysis: An Introduction. Springer-Verlag.
Zhang, Tian, Ramakrishnan, Raghu, and Livny, Miron (1996). BIRCH: An efficient data clustering method for very large databases. In SIGMOD.
Zhu, Yunyue and Shasha, Dennis (2002). StatStream: Statistical monitoring of thousands of data streams in real time. In VLDB.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Papadimitriou, S., Sun, J., Faloutsos, C. (2007). Dimensionality Reduction and Forecasting on Streams. In: Aggarwal, C.C. (eds) Data Streams. Advances in Database Systems, vol 31. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-47534-9_12
Download citation
DOI: https://doi.org/10.1007/978-0-387-47534-9_12
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-28759-1
Online ISBN: 978-0-387-47534-9
eBook Packages: Computer ScienceComputer Science (R0)