An evaluation and annotation methodology for product category matching in e-commerce

•Product category matching is an important task in digital marketplaces and e-commerce.•This paper motivates, describes and formalizes the problem of product category matching.•The paper also presents a rigorously designed methodology and guidelines for acquiring reliable and cost-effective annotati...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computers in industry 2021-10, Vol.131, p.103497, Article 103497
Hauptverfasser: Kejriwal, Mayank, Shen, Ke, Ni, Chien-Chun, Torzec, Nicolas
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•Product category matching is an important task in digital marketplaces and e-commerce.•This paper motivates, describes and formalizes the problem of product category matching.•The paper also presents a rigorously designed methodology and guidelines for acquiring reliable and cost-effective annotations for this task.•The utility of all methods presented is validated on three real-world e-commerce taxonomies. Product category matching is an important task in digital marketplaces and e-commerce, helping to power better search and recommendations in an online context. While variants of the problem have received some attention in academia, there is no documented guidance on how to efficiently acquire annotations for evaluating multiple (current and future) models, many of which rely on modern machine learning techniques such as neural representation learning. In this paper, we motivate and formalize the problem of product category matching in e-commerce, and present a rigorously designed set of guidelines and methodology for acquiring annotations in a cost-effective and reliable manner. We also present a methodology for using the annotations to compare solutions of two or more product category matching methods, including comparing models both before and after annotation. Three widely used e-commerce product category taxonomies, and multiple metrics, are used to demonstrate the utility of our proposals.
ISSN:0166-3615
1872-6194
DOI:10.1016/j.compind.2021.103497