We develop an unsupervised learning framework for extracting popular product attributes from different Web product description pages. Unlike existing systems which do not differentiate the popularity of the attributes, we propose a framework which is able not only to detect concerned popular features of a product from a collection of customer reviews, but also to map these popular features to the related product attributes, and at the same time to extract these attributes from description pages. To tackle the technical challenges, we develop a discriminative graphical model based on hidden Conditional Random Fields. We have conducted experiments on several product domains. The empirical results show that our framework is effective. Copyright © 2012 Springer-Verlag Berlin Heidelberg.
|Title of host publication||Information retrieval technology: 8th Asia Information Retrieval Societies Conference, AIRS 2012, Tianjin, China, December 17-19, 2012. Proceedings|
|Editors||Yuexian HOU, Jian-Yun NIE, Le SUN , Bo WANG , Peng ZHANG|
|Place of Publication||Berlin|
|Publisher||Springer Berlin Heidelberg|
|Publication status||Published - 2012|
CitationBing, L., Wong, T.-L., & Lam, W. (2012). Unsupervised extraction of popular product attributes from web sites. In Y. Hou, J.-Y. Nie, L. Sun, B. Wang, P. Zhang (Eds.), Information retrieval technology: 8th Asia Information Retrieval Societies Conference, AIRS 2012, Tianjin, China, December 17-19, 2012. Proceedings (pp. 437-446). Berlin: Springer Berlin Heidelberg.
- Information extraction
- Conditional random fields