APPECT: An approximate backbone-based clustering algorithm for tags

Yu ZONG, Guandong XU, Ping JIN, Yanchun ZHANG, Enhong CHEN, Rong PAN

Research output: Chapter in Book/Report/Conference proceedingChapters

3 Citations (Scopus)

Abstract

In social annotation systems, users label digital resources by using tags which are freely chosen textual descriptions. Tags are used to index, annotate and retrieve resource as an additional metadata of resource. Poor retrieval performance remains a major problem of most social tagging systems resulting from the severe difficulty of ambiguity, redundancy and less semantic nature of tags. Clustering method is a useful tool to address the aforementioned difficulties. Most of the researches on tag clustering are directly using traditional clustering algorithms such as K-means or Hierarchical Agglomerative Clustering on tagging data, which possess the inherent drawbacks, such as the sensitivity of initialization. In this paper, we instead make use of the approximate backbone of tag clustering results to find out better tag clusters. In particular, we propose an APProximate backbonE-based Clustering algorithm for Tags (APPECT).The main steps of APPECT are: (1) we execute the K-means algorithm on a tag similarity matrix for M times and collect a set of tag clustering results Z = C¹ ,C²,...,Cm ; (2) we form the approximate backbone of Z by executing a greedy search; (3) we fix the approximate backbone as the initial tag clustering result and then assign the rest tags into the corresponding clusters based on the similarity. Experimental results on three real world datasets namely MedWorm, MovieLens and Dmoz demonstrate the effectiveness and the superiority of the proposed method against the traditional approaches. Copyright © 2011 Springer-Verlag Berlin Heidelberg.

Original languageEnglish
Title of host publicationAdvanced data mining and applications: 7th International Conference, ADMA 2011, Beijing, China, December 17-19, 2011, Proceedings, Part I
EditorsJie TANG, Jianyong WANG, Irwin KING, Ling CHEN
PublisherSpringer
Pages175-189
ISBN (Electronic)9783642258534
ISBN (Print)9783642258527
DOIs
Publication statusPublished - 2011

Citation

Zong, Y., Xu, G., Jin, P., Zhang, Y., Chen, E., & Pan, R. (2011). APPECT: An approximate backbone-based clustering algorithm for tags. In J. Tang, J. Wang, I. King, & L. Chen (Eds.), Advanced data mining and applications: 7th International Conference, ADMA 2011, Beijing, China, December 17-19, 2011, Proceedings, Part I (pp. 175-189). Springer. https://doi.org/10.1007/978-3-642-25853-4_14

Keywords

  • Approximate backbone
  • Tag clustering
  • Social annotation systems

Fingerprint

Dive into the research topics of 'APPECT: An approximate backbone-based clustering algorithm for tags'. Together they form a unique fingerprint.