Mining frequent patterns in print logs with semantically alternative labels

Xin LI, Lei ZHANG, Enhong CHEN, Yu ZONG, Guandong XU

Research output: Chapter in Book/Report/Conference proceedingChapters

1 Citation (Scopus)

Abstract

It is common today for users to print the informative information from webpages due to the popularity of printers and internet. Thus, many web printing tools such as Smart Print and PrintUI are developed for online printing. In order to improve the users' printing experience, the interaction data between users and these tools are collected to form a so-called print log data, where each record is the set of urls selected for printing by a user within a certain period of time. Apparently, mining frequent patterns from these print log data can capture user intentions for other applications, such as printing recommendation and behavior targeting. However, mining frequent patterns by directly using url as item representation in print log data faces two challenges: data sparsity and pattern interpretability. To tackle these challenges, we attempt to leverage delicious api (a social bookmarking web service) as an external thesaurus to expand the semantics of each url by selecting tags associated with the domain of each url. In this setting, the frequent pattern mining is employed on the tag representation of each url rather than the url or domain representation. With the enhancement of semantically alternative tag representation, the semantics of url is substantially improved, thus yielding the useful frequent patterns. To this end, in this paper we propose a novel pattern mining problem, namely mining frequent patterns with semantically alternative labels, and propose an efficient algorithm named PaSAL (Frequent Patterns with Semantically Alternative Labels Mining Algorithm) for this problem. Specifically, we propose a new constraint named conflict matrix to purify the redundant patterns to achieve a high efficiency. Finally, we evaluate the proposed algorithm on a real print log data. Copyright © 2013 Springer-Verlag Berlin Heidelberg.

Original languageEnglish
Title of host publicationAdvanced Data Mining and Applications: 9th International Conference, ADMA 2013, proceedings, part II
EditorsHiroshi MOTODA, Zhaohui WU, Longbing CAO, Osmar ZAIANE, Min YAO, Wei WANG
PublisherSpringer
Pages107-119
ISBN (Electronic)9783642539176
ISBN (Print)9783642539169
DOIs
Publication statusPublished - Dec 2013

Citation

Li, X., Zhang, L., Chen, E., Zong, Y., & Xu, G. (2013). Mining frequent patterns in print logs with semantically alternative labels. In H. Motoda, Z. Wu, L. Cao, O. Zaiane, M. Yao, & W. Wang (Eds.), Advanced Data Mining and Applications: 9th International Conference, ADMA 2013, proceedings, part II (pp. 107-119). Springer. https://doi.org/10.1007/978-3-642-53917-6_10

Fingerprint

Dive into the research topics of 'Mining frequent patterns in print logs with semantically alternative labels'. Together they form a unique fingerprint.