Detecting Unusual Behaviour and Mining Unstructured Data
Alexander Balinsky (),
Helen Balinsky () and
Steven Simske ()
Additional contact information
Alexander Balinsky: Cardiff University, Cardiff School of Mathematics
Helen Balinsky: Hewlett-Packard Laboratories
Steven Simske: Hewlett-Packard Laboratories
A chapter in UK Success Stories in Industrial Mathematics, 2016, pp 181-187 from Springer
Abstract:
Abstract Keyword and feature extraction is a fundamental problem in data mining and document processing. A majority of applications directly depend on the quality and speed of keyword and feature extraction pre-processing results. In the current paper we present novel algorithms for feature extraction and change detection in unstructured data, primarily in textual and sequential data. Our approach is based on ideas from image processing and especially on the Helmholtz Principle from the Gestalt Theory of human perception. The improvements due to the novel feature extraction technique are demonstrated on several key applications: classification for strengthening document security and storage optimization, automatic summarization and segmentation for problems of information overload. The developed algorithms and applications are the result of research collaboration between Cardiff University School of Mathematics and HP Laboratories.
Keywords: Mining Unstructured Data; Helmholtz Principle; Cardiff University School; Gestalt Theory; Extractive Text Summarization (search for similar items in EconPapers)
Date: 2016
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-3-319-25454-8_23
Ordering information: This item can be ordered from
http://www.springer.com/9783319254548
DOI: 10.1007/978-3-319-25454-8_23
Access Statistics for this chapter
More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().