Mining approximate sequential patterns with gaps
Kelly K. Yip and
David A. Nembhard
International Journal of Data Mining, Modelling and Management, 2015, vol. 7, issue 2, 108-129
Abstract:
Time series data are found in diverse fields including, science, business, medicine and engineering. In this paper, we consider sequential pattern mining for categorical time series data that contain multiple independent time-series. Frequent patterns are considered important in a variety of applications. However, it is common for data to contain noise, and/or for the source process to have considerable variability. Conventional sequential pattern mining methods that use exact matching address, some but not all of these difficulties. Two general approaches used in previous studies to mine sequential patterns in data with noise are distance-based clustering and hidden Markov models. While these approaches are useful in mining frequent sequential patterns in noisy data, we further propose a framework (MWASP: multiple-width approximate sequential pattern mining) that uncovers frequent approximate sequential patterns with various widths. A mined pattern in this framework is representative of a group of sequences that follow the pattern's event flow order. This gives insight into the occurrence of the pattern longitudinally, as well as across the population. The pattern can be recognised as a common pattern across the multiple time series, time, or both.
Keywords: data mining; hidden Markov model; HMM; sequential pattern search; sequential pattern mining; approximate sequential patterns; gaps; time series data; multiple time series. (search for similar items in EconPapers)
Date: 2015
References: Add references at CitEc
Citations:
Downloads: (external link)
http://www.inderscience.com/link.php?id=69249 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:ids:ijdmmm:v:7:y:2015:i:2:p:108-129
Access Statistics for this article
More articles in International Journal of Data Mining, Modelling and Management from Inderscience Enterprises Ltd
Bibliographic data for series maintained by Sarah Parker ().