Proactive and automatic detection of product misclassifications at massive scale
2023
In e-commerce, product classification is widely used for various purposes. Misclassifying products can cause compliance issues and hurt the company’s reputation. To address this problem, we propose an automated system to proactively detect product misclassifications by overcoming several challenges. A large ecommerce retailer can sell billions of distinct products, on which many thousands of classification tasks are performed. At this massive scale, we need to quickly detect misclassifications under a limited budget. In this talk, we point out these challenges and show how we design our system to handle them. When evaluated on a set of Amazon’s product classification data, at an overhead of <10% of the classification cost, our system automatically identified and corrected many misclassifications, which would take a human many thousand years to manually find and 14.6 years to manually review and correct if our system were not used.
Research areas