GraphClust: A Method for Clustering Database of Graphs
Diego Reforgiato (),
Rodrigo Gutierrez () and
Dennis Shasha ()
Additional contact information
Diego Reforgiato: Dipartimento di Matematica e Informatica, Università degli Studi di Catania, Italy
Rodrigo Gutierrez: Biology Department, New York University, USA;
Dennis Shasha: Computer Science Department, New York University, USA
Journal of Information & Knowledge Management (JIKM), 2008, vol. 07, issue 04, 231-241
Abstract:
Any application that represents data as sets of graphs may benefit from the discovery of relationships among those graphs. To do this in an unsupervised fashion requires the ability to find graphs that are similar to one another. That is the purpose of GraphClust. The GraphClust algorithm proceeds in three phases, often building on other tools:(1) it finds highly connected substructures in each graph;(2) it uses those substructures to represent each graph as a feature vector; and(3) it clusters these feature vectors using a standard distance measure. We validate the cluster quality by using the Silhouette method. In addition to clustering graphs, GraphClust uses SVD decomposition to find frequently co-occurring connected substructures. The main novelty of GraphClust compared to previous methods is that it is application-independent and scalable to many large graphs.
Keywords: Text clustering; document vectors; graph clustering; graph substructure matching (search for similar items in EconPapers)
Date: 2008
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
http://www.worldscientific.com/doi/abs/10.1142/S0219649208002093
Access to full text is restricted to subscribers
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:wsi:jikmxx:v:07:y:2008:i:04:n:s0219649208002093
Ordering information: This journal article can be ordered from
DOI: 10.1142/S0219649208002093
Access Statistics for this article
Journal of Information & Knowledge Management (JIKM) is currently edited by Professor Suliman Hawamdeh
More articles in Journal of Information & Knowledge Management (JIKM) from World Scientific Publishing Co. Pte. Ltd.
Bibliographic data for series maintained by Tai Tone Lim ().