KIPTC: A kernel information propagation tag clustering algorithm

Guandong XU, Yu ZONG, Ping JIN, Rong PAN, Zongda WU

Research output: Contribution to journalArticlespeer-review

19 Citations (Scopus)

Abstract

In the social annotation systems, users annotate digital data sources by using tags which are freely chosen textual descriptions. Tags are used to index, annotate and retrieve resource as an additional metadata of resource. Poor retrieval performance remains a major challenge of most social annotation systems resulting from several problems of ambiguity, redundancy and less semantic nature of tags. Clustering is a useful tool to handle these problems in social annotation systems. In this paper, we propose a novel tag clustering algorithm based on kernel information propagation. This approach makes use of the kernel density estimation of the kNN neighborhood directed graph as a start to reveal the prestige rank of tags in tagging data. The random walk with restart algorithm is then employed to determine the center points of tag clusters. The main strength of the proposed approach is the capability of partitioning tags from the perspective of tag prestige rank rather than the intuitive similarity calculation itself. Experimental studies on the six real world data sets demonstrate the effectiveness and superiority of the proposed method against other state-of-the-art clustering approaches in terms of various evaluation metrics. Copyright © 2013 Springer Science+Business Media New York.

Original languageEnglish
Pages (from-to)95-112
JournalJournal of Intelligent Information Systems
Volume45
Early online dateJul 2013
DOIs
Publication statusPublished - Aug 2015

Citation

Xu, G., Zong, Y., Jin, P., Pan, R., & Wu, Z. (2015). KIPTC: A kernel information propagation tag clustering algorithm. Journal of Intelligent Information Systems, 45, 95-112. https://doi.org/10.1007/s10844-013-0262-7

Keywords

  • Social tagging systems
  • Tag clustering
  • Kernel information propagation

Fingerprint

Dive into the research topics of 'KIPTC: A kernel information propagation tag clustering algorithm'. Together they form a unique fingerprint.