Multi-Modal Fashion Product Retrieval

A. Rubio, Long Long Yu, E. Simo-Serra, F. Moreno-Noguer

研究成果: Conference contribution

抄録

Finding a product in the fashion world can be a daunting task. Everyday, e-commerce sites are updating with thousands of images and their associated metadata (textual information), deepening the problem. In this paper, we leverage both the images and textual metadata and propose a joint multi-modal embedding that maps both the text and images into a common latent space. Distances in the latent space correspond to similarity between products, allowing us to effectively perform retrieval in this latent space. We compare against existing approaches and show significant improvements in retrieval tasks on a large-scale e-commerce dataset.

本文言語English
ホスト出版物のタイトルVL 2017 - 6th Workshop on Vision and Language, Proceedings of the Workshop
出版社Association for Computational Linguistics (ACL)
ページ43-45
ページ数3
ISBN(電子版)9781945626517
出版ステータスPublished - 2017
イベント6th Workshop on Vision and Language, VL 2017 as part of EACL 2017 - Valencia, Spain
継続期間: 2017 4月 4 → …

出版物シリーズ

名前VL 2017 - 6th Workshop on Vision and Language, Proceedings of the Workshop

Conference

Conference6th Workshop on Vision and Language, VL 2017 as part of EACL 2017
国/地域Spain
CityValencia
Period17/4/4 → …

ASJC Scopus subject areas

  • コンピュータ グラフィックスおよびコンピュータ支援設計
  • コンピュータ ビジョンおよびパターン認識
  • 人間とコンピュータの相互作用
  • 言語学および言語

フィンガープリント

「Multi-Modal Fashion Product Retrieval」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル