Abstract
We develop an unsupervised learning framework for extracting popular product attributes from product description pages originated from different E-commerce Web sites. Unlike existing information extraction methods that do not consider the popularity of product attributes, our proposed framework is able to not only detect popular product features from a collection of customer reviews but also map these popular features to the related product attributes. One novelty of our framework is that it can bridge the vocabulary gap between the text in product description pages and the text in customer reviews. Technically, we develop a discriminative graphical model based on hidden Conditional Random Fields. As an unsupervised model, our framework can be easily applied to a variety of new domains and Web sites without the need of labeling training samples. Extensive experiments have been conducted to demonstrate the effectiveness and robustness of our framework. Copyright © 2016 ACM.
Original language | English |
---|---|
Article number | 12 |
Journal | ACM Transactions on Internet Technology (TOIT) |
Volume | 16 |
Issue number | 2 |
DOIs | |
Publication status | Published - Apr 2016 |
Citation
Bing, L., Wong, T.-L., & Lam, W. (2016). Unsupervised extraction of popular product attributes from e-commerce web sites by considering customer reviews. ACM Transactions on Internet Technology (TOIT), 16(2), Article No. 12.Keywords
- Information extraction
- Conditional random fields
- Product attribute
- Customer reviews