Abstract
Product titles on e-commerce marketplaces often suffer from verbosity and inaccuracy, hindering effective communication of essential product details to customers. Refining titles to be more concise and informative is crucial for better user experience and product promotion. Recent solutions to product title refinement follow the standard text extractive and generative methods. Some also leverage multimodal information, e.g. using product images to supplement original titles with visual knowledge. However, these generative methods often produce additional terms not endorsed by sellers. Thus, it remains challenging to incorporate visual information missing from original titles into refined titles without excessively introducing novel terms. Additionally, most existing methods require human-labeled datasets, which are laborious to construct. In response to the two challenges, we present a self-supervised multimodal framework (HLATR) for title refinement that comprises two key modules: (1) a perturbated sample generator that constructs training data by systematically mining homogeneous listing information and (2) a title refinement network that effectively harnesses visual information to refine the original titles. To explicitly balance the extraction from original titles and the generation of supplementary novel terms, we adapt the copy mechanism that is guided by a focused refinement loss. Extensive experiments demonstrate that our proposed framework consistently outperforms others in generating refined titles that contain essential multimodal semantics with minimal deviation from the original ones. Copyright © 2024 held by the owner/author(s).
Original language | English |
---|---|
Title of host publication | Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2024 |
Place of Publication | USA |
Publisher | Association for Computing Machinery |
Pages | 2870-2874 |
ISBN (Electronic) | 9798400704314 |
DOIs | |
Publication status | Published - Jul 2024 |
Citation
Deng, J., Shi, K., Huo, H., Wang, D., & Xu, G. (2024). Homogeneous-listing-augmented self-supervised multimodal product title refinement. In Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2024 (pp. 2870-2874). Association for Computing Machinery. https://doi.org/10.1145/3626772.3661347Keywords
- Product title refinement
- Self-supervised learning
- Multimodal generative models