Unsupervised extraction of popular product attributes from web sites

Lidong BING, Tak Lam WONG, Wai LAM

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

We develop an unsupervised learning framework for extracting popular product attributes from different Web product description pages. Unlike existing systems which do not differentiate the popularity of the attributes, we propose a framework which is able not only to detect concerned popular features of a product from a collection of customer reviews, but also to map these popular features to the related product attributes, and at the same time to extract these attributes from description pages. To tackle the technical challenges, we develop a discriminative graphical model based on hidden Conditional Random Fields. We have conducted experiments on several product domains. The empirical results show that our framework is effective. Copyright © 2012 Springer-Verlag Berlin Heidelberg.
Original languageEnglish
Title of host publicationInformation retrieval technology: 8th Asia Information Retrieval Societies Conference, AIRS 2012, Tianjin, China, December 17-19, 2012. Proceedings
EditorsYuexian HOU, Jian-Yun NIE, Le SUN , Bo WANG , Peng ZHANG
Place of PublicationBerlin
PublisherSpringer Berlin Heidelberg
Pages437-446
ISBN (Print)9783642353406
DOIs
Publication statusPublished - 2012

Fingerprint

Unsupervised learning
Websites
Experiments

Citation

Bing, L., Wong, T.-L., & Lam, W. (2012). Unsupervised extraction of popular product attributes from web sites. In Y. Hou, J.-Y. Nie, L. Sun, B. Wang, P. Zhang (Eds.), Information retrieval technology: 8th Asia Information Retrieval Societies Conference, AIRS 2012, Tianjin, China, December 17-19, 2012. Proceedings (pp. 437-446). Berlin: Springer Berlin Heidelberg.

Keywords

  • Information extraction
  • Conditional random fields