Homogeneous-listing-augmented self-supervised multimodal product title refinement

Jiaqi DENG, Kaize SHI, Huan HUO, Dingxian WANG, Guandong XU

Research output: Chapter in Book/Report/Conference proceedingChapters

1 Citation (Scopus)

Abstract

Product titles on e-commerce marketplaces often suffer from verbosity and inaccuracy, hindering effective communication of essential product details to customers. Refining titles to be more concise and informative is crucial for better user experience and product promotion. Recent solutions to product title refinement follow the standard text extractive and generative methods. Some also leverage multimodal information, e.g. using product images to supplement original titles with visual knowledge. However, these generative methods often produce additional terms not endorsed by sellers. Thus, it remains challenging to incorporate visual information missing from original titles into refined titles without excessively introducing novel terms. Additionally, most existing methods require human-labeled datasets, which are laborious to construct. In response to the two challenges, we present a self-supervised multimodal framework (HLATR) for title refinement that comprises two key modules: (1) a perturbated sample generator that constructs training data by systematically mining homogeneous listing information and (2) a title refinement network that effectively harnesses visual information to refine the original titles. To explicitly balance the extraction from original titles and the generation of supplementary novel terms, we adapt the copy mechanism that is guided by a focused refinement loss. Extensive experiments demonstrate that our proposed framework consistently outperforms others in generating refined titles that contain essential multimodal semantics with minimal deviation from the original ones. Copyright © 2024 held by the owner/author(s).

Original languageEnglish
Title of host publicationProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2024
Place of PublicationUSA
PublisherAssociation for Computing Machinery
Pages2870-2874
ISBN (Electronic)9798400704314
DOIs
Publication statusPublished - Jul 2024

Citation

Deng, J., Shi, K., Huo, H., Wang, D., & Xu, G. (2024). Homogeneous-listing-augmented self-supervised multimodal product title refinement. In Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2024 (pp. 2870-2874). Association for Computing Machinery. https://doi.org/10.1145/3626772.3661347

Keywords

  • Product title refinement
  • Self-supervised learning
  • Multimodal generative models

Fingerprint

Dive into the research topics of 'Homogeneous-listing-augmented self-supervised multimodal product title refinement'. Together they form a unique fingerprint.