Proactive and automatic detection of product misclassifications at massive scale

Ling Jiang; Xiaoyu Chu; Saaransh Gulati; Pulkit Garg; Andrew Borthwick; Gang Luo

Publication

Proactive and automatic detection of product misclassifications at massive scale

By Ling Jiang, Xiaoyu Chu, Saaransh Gulati, Pulkit Garg, Andrew Borthwick, Gang Luo

2023

Download Copy BibTeX

Share

Download

Copy BibTeX

Share

In e-commerce, product classification is widely used for various purposes. Misclassifying products can cause compliance issues and hurt the company’s reputation. To address this problem, we propose an automated system to proactively detect product misclassifications by overcoming several challenges. A large ecommerce retailer can sell billions of distinct products, on which many thousands of classification tasks are performed. At this massive scale, we need to quickly detect misclassifications under a limited budget. In this talk, we point out these challenges and show how we design our system to handle them. When evaluated on a set of Amazon’s product classification data, at an overhead of <10% of the classification cost, our system automatically identified and corrected many misclassifications, which would take a human many thousand years to manually find and 14.6 years to manually review and correct if our system were not used.

Proactive and automatic detection of product misclassifications at massive scale

Latest news

Work with us