Machine learning

Developing algorithms and statistical models that computer systems use to perform tasks without explicit instructions, relying on patterns and inference instead.

AmazonQAC: A large-scale, naturalistic query autocomplete dataset

Dante Everaert, Rohit Patki, Tianqi Zheng, Christopher Potts

EMNLP 2024

2024

Query Autocomplete (QAC) is a critical feature in modern search engines, facilitating user interaction by predicting search queries based on input prefixes. Despite its widespread adoption, the absence of large-scale, realistic datasets has hindered advancements in QAC system development. This paper addresses this gap by introducing AmazonQAC, a new QAC dataset sourced from Amazon Search logs, comprising

Machine learning
Auto-evolve: Enhancing large language model’s performance via self-reasoning framework

Krishna Aswani, Alex Lu, Pranav Patankar, Priya Dhalwani, Iris Tan, Jayant Ganeshmohan, Simon Lacasse

Findings of EMNLP 2024

2024

Recent advancements in prompt engineering strategies, such as Chain-of-Thought (CoT) and Self-Discover, have demonstrated significant potential in improving the reasoning abilities of Large Language Models (LLMs). However, these state-of-the-art (SOTA) prompting strategies rely on single or fixed set of static seed reasoning modules like "think step by step" or "break down this problem" intended to simulate

Conversational AI
Sequential LLM framework for fashion recommendation

Han Liu, Xianfeng Tang, Tianlang Chen, Jiapeng Liu, Indu Indu, Henry Peng Zou, Peng Dai, Roberto Fernandez Galan, Mike Porter, Dongmei Jia, Ning Zhang, Lian Xiong

EMNLP 2024

2024

The fashion industry is one of the leading domains in the global e-commerce sector, prompting major online retailers to employ recommendation systems for product suggestions and customer convenience. While recommendation systems have been widely studied, most are designed for general e-commerce problems and struggle with the unique challenges of the fashion domain. To address these issues, we propose a

Machine learning
Revisiting SMoE language models by evaluating inefficiencies with task specific expert pruning

Soumajyoti Sarkar, Leonard Lausen, Volkan Cevher, Sheng Zha, Thomas Brox, George Karypis

NeurIPS 2024 Workshop on Efficient Natural Language and Speech Processing (ENLSP-IV)

2024

Sparse Mixture of Expert (SMoE) models have emerged as a scalable alternative to dense models in language modeling. These models use conditionally activated feedforward subnetworks in transformer blocks, allowing for a separation between total model parameters and per-example computation. However, large token-routed SMoE models face a significant challenge: during inference, the entire model must be used

Machine learning
E-commerce product categorization with LLM-based dual-expert classification paradigm

Zhu Cheng, Wen Zhang, Chih-Chi (Jimmy) Chou, You-Yi Jau, Archita Pathak, Penny Gao, Umit Batur

EMNLP 2024 Workshop on Customizable NLP

2024

Accurate product categorization in e-commerce is critical for delivering a satisfactory online shopping experience to customers. With the vast number of available products and the numerous potential categories, it becomes crucial to develop a classification system capable of assigning products to their correct categories with high accuracy. We present a dual-expert classification system that utilizes the

Conversational AI

re:MARS 2019: Jenny Freshwater keynote presentation

November 26, 2019

Amazon's director of forecasting, Jenny Freshwater, speaks about how AI is used to power forecasting decisions, so that items are always in stock for Amazon's customers.

Machine learning
3 important themes from Amazon's 2019 NeurIPS papers

Larry Hardesty

November 25, 2019

Time series forecasting, bandit problems, and optimization are integral to Amazon's efforts to deliver better value for its customers.

Machine learning
Credit: Flavia Loreto

Artificial Intelligence—The revolution hasn’t happened yet

Michael Jordan

November 22, 2019

Michael I. Jordan, Amazon Scholar and professor at the University of California, Berkeley, writes about the classical goals in human-imitative AI, and reflects on how in the current hubbub over the AI revolution it is easy to forget that these goals haven’t yet been achieved.

Machine learning
The history of Amazon's recommendation algorithm

Larry Hardesty

November 22, 2019

In 2017, when the journal IEEE Internet Computing was celebrating its 20th anniversary, its editorial board decided to identify the single paper from its publication history that had best withstood the “test of time”. The honor went to a 2003 paper called “Amazon.com Recommendations: Item-to-Item Collaborative Filtering”, by then Amazon researchers Greg Linden, Brent Smith, and Jeremy York.

Search and information retrieval
_{Animation by Lenni Armstrong, inform-motion}

How to construct the optimal neural architecture for your machine learning task

Adrian de Wynter

September 23, 2019

The first step in training a neural network to solve a problem is usually the selection of an architecture: a specification of the number of computational nodes in the network and the connections between them. Architectural decisions are generally based on historical precedent, intuition, and plenty of trial and error.

Machine learning
Animation by Nick Little

Accelerating parallel training of neural nets

Pranav Ladkat

September 5, 2019

Earlier this year, we reported a speech recognition system trained on a million hours of data, a feat possible through semi-supervised learning, in which training data is annotated by machines rather than by people. These sorts of massive machine learning projects are becoming more common, and they require distributing the training process across multiple processors. Otherwise, training becomes too time consuming.

Machine learning

Machine learning

Recent publications

Related content

Work with us