Affiliation:
1. University of Nebraska at Omaha, USA
2. University of Albany, SUNY, USA
Abstract
Time series data is usually generated by measuring and monitoring applications, and accounts for a large fraction of the data available for analysis purposes. A time series is typically a sequence of values that represent the state of a variable over time. Each value of the variable might be a simple value, or might have a composite structure, such as a vector of values. Time series data can be collected about natural phenomena, such as the amount of rainfall in a geographical region, or about a human activity, such as the number of shares of GoogleTM stock sold each day. Time series data is typically used for predicting future behavior from historical performance. However, a time series often needs further processing to discover the structure and properties of the recorded variable, thereby facilitating the understanding of past behavior and prediction of future behavior. Segmentation of a given time series is often used to compactly represent the time series (Gionis & Mannila, 2005), to reduce noise, and to serve as a high-level representation of the data (Das, Lin, Mannila, Renganathan & Smyth, 1998; Keogh & Kasetty, 2003). Data mining of a segmentation of a time series, rather than the original time series itself, has been used to facilitate discovering structure in the data, and finding various kinds of information, such as abrupt changes in the model underlying the time series (Duncan & Bryant, 1996; Keogh & Kasetty, 2003), event detection (Guralnik & Srivastava, 1999), etc. The rest of this chapter is organized as follows. The section on Background gives an overview of the time series segmentation problem and solutions. This section is followed by a Main Focus section where details of the tasks involved in segmenting a given time series and a few sample applications are discussed. Then, the Future Trends section presents some of the current research trends in time series segmentation and the Conclusion section concludes the chapter. Several important terms and their definitions are also included at the end of the chapter.
Reference24 articles.
1. Modified Gath–Geva clustering for fuzzy segmentation of multivariate time-series
2. Chundi, P., & Rosenkrantz, D. J. (2004a), Constructing Time Decompositions for Time Stamped Documents, SIAM Fourth International Conference on Data Mining, 57-68.
3. Chundi, P., & Rosenkrantz, D. J. (2004b), On Lossy Time Decompositions of Time Stamped Documents, 2004 ACM CIKM Conference on Information and Knowledge Management, 437-445.
4. An Evolutionary Approach to Pattern-Based Time Series Segmentation
5. Cohen, P. R., Heeringa, B., & Adams, M. N. (2002), Unsupervised Segmentation of Categorical Time Series into Episodes, Proceedings of the 2002 IEEE International Conference on Data Mining, 99-106.
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献