EconPapers    
Economics at your fingertips  
 

DISSparse: Efficient Mining of Discriminative Itemsets

Majid Seyfi, Richi Nayak (), Yue Xu () and Shlomo Geva ()
Additional contact information
Majid Seyfi: Data Science Discipline, Science and Engineering faculty, Queensland University of Technology, Brisbane, Queensland 4000, Australia
Richi Nayak: Data Science Discipline, Science and Engineering faculty, Queensland University of Technology, Brisbane, Queensland 4000, Australia
Yue Xu: Data Science Discipline, Science and Engineering faculty, Queensland University of Technology, Brisbane, Queensland 4000, Australia
Shlomo Geva: Data Science Discipline, Science and Engineering faculty, Queensland University of Technology, Brisbane, Queensland 4000, Australia

Journal of Information & Knowledge Management (JIKM), 2022, vol. 21, issue 01, 1-42

Abstract: We tackle the problem of discriminative itemset mining. Given a set of datasets, we want to find the itemsets that are frequent in the target dataset and have much higher frequencies compared with the same itemsets in other datasets. Such itemsets are very useful for dataset discrimination. We demonstrate that this problem has important applications and, at a same time, is very challenging. We present the DISSparse algorithm, a mining method that uses two determinative heuristics based on the sparsity characteristics of the discriminative itemsets as a small subset of the frequent itemsets. We prove that the DISSparse algorithm is sound and complete. We experimentally investigate the performance of the proposed DISSparse on a range of datasets, evaluating its efficiency and stability and demonstrating it is substantially faster than the baseline method.

Keywords: Data mining; discriminative itemsets; prefix tree (search for similar items in EconPapers)
Date: 2022
References: Add references at CitEc
Citations:

Downloads: (external link)
http://www.worldscientific.com/doi/abs/10.1142/S0219649222500095
Access to full text is restricted to subscribers

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:wsi:jikmxx:v:21:y:2022:i:01:n:s0219649222500095

Ordering information: This journal article can be ordered from

DOI: 10.1142/S0219649222500095

Access Statistics for this article

Journal of Information & Knowledge Management (JIKM) is currently edited by Professor Suliman Hawamdeh

More articles in Journal of Information & Knowledge Management (JIKM) from World Scientific Publishing Co. Pte. Ltd.
Bibliographic data for series maintained by Tai Tone Lim ().

 
Page updated 2025-03-20
Handle: RePEc:wsi:jikmxx:v:21:y:2022:i:01:n:s0219649222500095