Behavior-driven query similarity prediction based on pre-trained language models for e-commerce search

Yupin Huang; Jiri Gesi; Xinyu Hong; Han Cheng; Kai Zhong; Vivek Mittal; Qingjun Cui; Vamsi Salaka

Publication

Behavior-driven query similarity prediction based on pre-trained language models for e-commerce search

By Yupin Huang, Jiri Gesi, Xinyu Hong, Han Cheng, Kai Zhong, Vivek Mittal, Qingjun Cui, Vamsi Salaka

2023

Download Copy BibTeX

Share

Download

Copy BibTeX

Share

Pre-trained language models (PLM) excel at capturing semantic similarity in language, while in e-commerce, customer shopping behavior data (e.g., clicks, add-to-cart, purchases) helps establish connections between similar queries based on behavior on products. This work addressed the challenges of using sparse behavior data to build a robust query-to-query similarity prediction model and apply it to a product search ranking system. Our contributions include a straightforward method for data generation, testing different model structures on both public PLMs and in-house PLMs fine-tuned with Amazon internal data. The fine-tuned in-house PLM model shows a 27.4% NDCG improvement compared with the BERT. And we designed an end-to-end pipeline that incorporates model outputs into prior feature. The prior scores can be used to impact ranking, matching, and recommendation systems. We tested the prior in an online experiment, which led to a significant improvement of 0.08% in the search click rate and a 0.03% reduction in the search reformulation rate. Overall, our approach has significant implications for improving search ranking and matching applications.

Behavior-driven query similarity prediction based on pre-trained language models for e-commerce search

Latest news

Work with us