DISSparse: Efficient Mining of Discriminative Itemsets
Majid Seyfi,
Richi Nayak (),
Yue Xu () and
Shlomo Geva ()
Additional contact information
Majid Seyfi: Data Science Discipline, Science and Engineering faculty, Queensland University of Technology, Brisbane, Queensland 4000, Australia
Richi Nayak: Data Science Discipline, Science and Engineering faculty, Queensland University of Technology, Brisbane, Queensland 4000, Australia
Yue Xu: Data Science Discipline, Science and Engineering faculty, Queensland University of Technology, Brisbane, Queensland 4000, Australia
Shlomo Geva: Data Science Discipline, Science and Engineering faculty, Queensland University of Technology, Brisbane, Queensland 4000, Australia
Journal of Information & Knowledge Management (JIKM), 2022, vol. 21, issue 01, 1-42
Abstract:
We tackle the problem of discriminative itemset mining. Given a set of datasets, we want to find the itemsets that are frequent in the target dataset and have much higher frequencies compared with the same itemsets in other datasets. Such itemsets are very useful for dataset discrimination. We demonstrate that this problem has important applications and, at a same time, is very challenging. We present the DISSparse algorithm, a mining method that uses two determinative heuristics based on the sparsity characteristics of the discriminative itemsets as a small subset of the frequent itemsets. We prove that the DISSparse algorithm is sound and complete. We experimentally investigate the performance of the proposed DISSparse on a range of datasets, evaluating its efficiency and stability and demonstrating it is substantially faster than the baseline method.
Keywords: Data mining; discriminative itemsets; prefix tree (search for similar items in EconPapers)
Date: 2022
References: Add references at CitEc
Citations:
Downloads: (external link)
http://www.worldscientific.com/doi/abs/10.1142/S0219649222500095
Access to full text is restricted to subscribers
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:wsi:jikmxx:v:21:y:2022:i:01:n:s0219649222500095
Ordering information: This journal article can be ordered from
DOI: 10.1142/S0219649222500095
Access Statistics for this article
Journal of Information & Knowledge Management (JIKM) is currently edited by Professor Suliman Hawamdeh
More articles in Journal of Information & Knowledge Management (JIKM) from World Scientific Publishing Co. Pte. Ltd.
Bibliographic data for series maintained by Tai Tone Lim ().