Search and information retrieval

Developing advanced techniques to analyze behavioral patterns, lexical matches, and semantic matches to surface the most relevant recommendations in response to your queries.

Exploring ℓ0 sparsification for inference-free sparse retrievers

Zhichao Geng, Xinjie Shen, Charlie Yang

SIGIR 2025

2025

With increasing demands for efficiency, information retrieval has developed a branch of sparse retrieval, further advancing towards inference-free retrieval where the documents are encoded during indexing time and there is no model-inference for queries. Existing sparse retrieval models rely on FLOPS regularization for sparsification, while this mechanism was originally designed for Siamese encoders, it

Search and information retrieval
Clustering context in off-policy evaluation

Daniel Guzman Olivares, Philipp Schmidt, Jacek Golebiowski, Artur Bekasov

AISTATS 2025

2025

Off-policy evaluation can leverage logged data to estimate the effectiveness of new policies in ecommerce, search engines, media streaming services, or automatic diagnostic tools in healthcare. However, the performance of baseline off-policy estimators like IPS deteriorates when the logging policy significantly differs from the evaluation policy. Recent work proposes sharing information across similar actions

Search and information retrieval
Learning to rewrite negation queries in product search

Mengtian Guo, Mutasem Al-Darabsah, Choon Hui Teo, Jonathan May, Tarun Agarwal, Rahul Bhagat

COLING 2025

2025

In product search, negation is frequently used to articulate unwanted product features or components. Modern search engines often struggle to comprehend negations, resulting in suboptimal user experiences. While various methods have been proposed to tackle negations in search, none of them took the vocabulary gap between query keywords and product text into consideration. In this work, we introduced a query

Search and information retrieval
RTSM: Knowledge distillation with diverse signals for efficient real-time semantic matching in e-commerce

Sanjay Agrawal, Vivek Sembium

NAACL 2025

2025

Semantic matching plays a pivotal role in e-commerce by facilitating better product discovery and driving sales within online stores. Transformer models have proven exceptionally effective in mapping queries to an embedding space, positioning semantically related entities (queries or products) in close proximity. De-spite their effectiveness, the high computational demands of large transformer models pose

Search and information retrieval
Personalised outfit recommendation via history-aware transformers

Myong Chol Jung, Julien Monteil, Philip Schulz, Volodymyr Vaskovych

WSDM 2025

2025

We present the history-aware transformer (HAT), a transformer-based model that uses shoppers’ purchase history to personalise outfit predictions. The aim of this work is to recommend outfits that are internally coherent while matching an individual shopper’s style and taste. To achieve this, we stack two transformer models, one that produces outfit representations and another one that processes the history

Search and information retrieval

Using generative AI to do multimodal information retrieval

Sungyeon Kim, Xiaofan Lin

June 25, 2025

With large datasets, directly generating data ID codes from query embeddings is much more efficient than performing pairwise comparisons between queries and candidate responses.

Search and information retrieval
The technology behind Amazon’s GenAI-powered shopping assistant, Rufus

Trishul Chilimbi

October 04, 2024

Rufus leverages AWS chips Trainium and Inferentia, AWS’s elasticity and scalability, and a custom-built large language model to quickly answer shoppers’ questions.

Search and information retrieval
Interpretable ensemble models improve product retrieval

Nurendra Choudhary

July 03, 2024

Gradient-boosted decision trees aggregate model outputs, and Shapley values help identify the most useful models for the ensemble.

Search and information retrieval
Building commonsense knowledge graphs to aid product recommendation

Changlong Yu, Zheng Li

May 10, 2024

Using large language models to discern commonsense relationships can improve performance on downstream tasks by as much as 60%.

Search and information retrieval
Teaching household robots where to find requested objects

Gunnar Sigurdsson

October 06, 2023

Leveraging a large vision-language foundation model enables state-of-the-art performance in remote-object grounding.

Robotics
Ensuring that customers don't miss out on trending products

Hao Ding, Yifei Ma

September 26, 2023

Time series forecasting enables up-to-the-minute trend recognition, while novel two-step training process improves forecast accuracy.

Search and information retrieval

Search and information retrieval

Related publications

Related content

Work with us