Search - Amazon Science

Amazon Search: The joy of ranking products

SIGIR 2016

2016

Amazon is one of the world’s largest e-commerce sites and Amazon Search powers the majority of Amazon’s sales. As a consequence, even small improvements in relevance ranking both positively influence the shopping experience of millions of customers and significantly impact revenue. In the past, Amazon’s product search engine consisted of several handtuned ranking functions using a handful of input features

Search and information retrieval

Multi-task learning and Weighted Cross-entropy for DNN-based Keyword Spotting

Sankaran Panchapagesan, Ming Sun, Aparna Khare, Spyros Matsoukas, Arindam Mandal, Björn Hoffmeister, Shiv Vitaladevuni

Interspeech 2016

2016

We propose improved Deep Neural Network (DNN) training loss functions for more accurate single keyword spotting on resource-constrained embedded devices. The loss function modifications consist of a combination of multi-task training and weighted cross entropy. In the multi-task architecture, the keyword DNN acoustic model is trained with two tasks in parallel - the main task of predicting the keyword-specific

Conversational AI

Bayesian intermittent demand forecasting for large inventories

Matthias Seeger, David Salinas, Valentin Flunkert

NeurIPS 2016

2016

We present a scalable and robust Bayesian method for demand forecasting in the context of a large e-commerce platform, paying special attention to intermittent and bursty target statistics. Inference is approximated by the Newton-Raphson algorithm, reduced to linear-time Kalman smoothing, which allows us to operate on several orders of magnitude larger problems than previous related work. In a study on

Operations research and optimization

Online dual decomposition for performance and delivery-based distributed ad allocation

Jim Huang, Rodolphe Jenatton, Cédric Archambeau

KDD 2016

2016

Online optimization is central to display advertising, where we must sequentially allocate ad impressions to maximize the total welfare among advertisers, while respecting various advertiser-specified long-term constraints (e.g., total amount of the ad’s budget that is consumed at the end of the campaign). In this paper, we present the online dual decomposition (ODD) framework for large-scale, online, distributed

Operations research and optimization

Riemannian stochastic variance reduced gradient on Grassmann manifold

Bamdev Mishra, Hiroyuki Kasai, Hiroyuki Sato

NeurIPS 2016

2016

Stochastic variance reduction algorithms have recently become popular for minimizing the average of a large, but finite, number of loss functions. In this paper, we propose a novel Riemannian extension of the Euclidean stochastic variance reduced gradient algorithm (R-SVRG) to a compact manifold search space.

Operations research and optimization

Velocity-based storage assignment in semi-automated storage systems

Rong Yuan, Tolga Cezik, Stephen C. Graves

MSOM 2016

2016

Our research focuses on the storage decision in a semi-automated storage system, where the inventory is stored on mobile storage pods. In a typical system, each storage pod carries a mixture of items, and the inventory of each item is spread over multiple storage pods.

Operations research and optimization

Bounding the integrality distance of LP relaxations for structured prediction

Ben London, Ofer Meshi, Adrian Weller

NeurIPS 2016

2016

In structured prediction, a predictor optimizes an objective function over a combinatorial search space, such as the set of all image segmentations, or the set of all part-of-speech taggings. Unfortunately, finding the optimal structured labeling—sometimes referred to as maximum a posteriori (MAP) inference—is, in general, NP-hard [12], due to the combinatorial structure of the problem. Many inference approximations

Operations research and optimization

Near-optimal disjoint-path facility location through set cover by pairs

David S. Johnson, Lee Breslau, Ilias Diakonikolas, Nick Duffield, Yu Gu, Mohammad Taghi Hajiaghayi, Howard Karloff, Mauricio G. C. Resende, Subhabrata Sen

arXiv

2016

In this paper we consider two special cases of the “cover-by-pairs” optimization problem that arise when we need to place facilities so that each customer is served by two facilities that reach it by disjoint shortest paths. These problems arise in a network traffic monitoring scheme proposed by Breslau et al. and have potential applications to content distribution. The “set-disjoint” variant applies to

Operations research and optimization

Scalability comparison scripts for deep learning frameworks

Vishaal Kapoor, Indu Thangakrishnan, Piyush Ghai, Frank Liu, Vandana Kannan, Jake Lee, Qing Lan, Suraj Kota, Anirudh Subramanian, Manu Seth, Andrew Ayres, Roshani Nagmote, Chaitanya Prakash Bapat, Anton Chernov, Dhanasekar Karuppasamy, Hao Jin, Rohit Srivastava, Sandeep Krishnamurthy, Amol Lele, Henri Yandell

2016

This repository contains scripts that compares the scalability of deep learning frameworks. The scripts train Inception v3 and AlexNet using synchronous stochastic gradient descent (SGD). To run the comparison in reasonable time, we run few tens of iterations of SGD and compute the throughput as images processed per second. Comparisons can be done on clusters created with AWS CloudFormation using the Amazon

Machine learning

Amazon Redshift and the case for simpler data warehouses

Anurag Gupta, Deepak Agarwal, Derek Tan, Jakub Kulesza, Rahul Pathak, Stefano Stefani, Vidhya Srinivasan

ACM SIGMOD 2015

2015

Amazon Redshift is a fast, fully managed, petabyte-scale data warehouse solution that makes it simple and cost-effective to efficiently analyze large volumes of data using existing business intelligence tools. Since launching in February 2013, it has been Amazon Web Service’s (AWS) fastest growing service, with many thousands of customers and many petabytes of data under management. Amazon Redshift’s pace

Cloud and systems

Robust i-Vector Based Adaptation of DNN Acoustic Model for Speech Recognition

Sri Garimella, Arindam Mandal, Nikko Ström, Björn Hoffmeister, Spyros Matsoukas, Sree Hari Krishnan Parthasarathi

Interspeech 2015

2015

In the past, conventional i-vectors based on a Universal Background Model (UBM) have been successfully used as input features to adapt a Deep Neural Network (DNN) Acoustic Model (AM) for Automatic Speech Recognition (ASR). In contrast, this paper introduces Hidden Markov Model (HMM) based ivectors that use HMM state alignment information from an ASR system for estimating i-vectors. Further, we propose passing

Conversational AI

Accurate Endpointing with Expected Pause Duration

Baiyang Liu, Björn Hoffmeister, Ariya Rastrow

Interspeech 2015

2015

In an online automatic speech recognition system, the role of the endpoint detector is to infer when a user has finished speaking a query. Accurate and low-latency endpoint detection is crucial for natural voice interaction. Classic voice activity detector (VAD) based approaches monitor the incoming audio and trigger when a sufficiently long pause is detected. Such approaches are typically limited due to

Conversational AI

Scalable Distributed DNN Training Using Commodity GPU Cloud Computing

Nikko Ström

Interspeech 2015

2015

We introduce a new method for scaling up distributed Stochastic Gradient Descent (SGD) training of Deep Neural Networks (DNN). The method solves the well-known communication bottleneck problem that arises for data-parallel SGD because compute nodes frequently need to synchronize a replica of the model. We solve it by purposefully controlling the rate of weight-update per individual weight, which is in contrast

Cloud and systems

On challenges in machine learning model management

Sebastian Schelter, Felix Biessmann, Tim Januschowski, David Salinas, Stephan Seufert, Gyuri Szarvas

IEEE Data Engineering Bulletin

2015

The training, maintenance, deployment, monitoring, organization and documentation of machine learning (ML) models — in short, model management — is a critical task in virtually all production ML use cases. Wrong model management decisions can lead to poor performance of a ML system and result in high maintenance cost. As research on both infrastructure and algorithms is quickly evolving, there is a lack

Cloud and systems

fMLLR Based Feature-Space Speaker Adaptation of DNN Acoustic Models

Sree Hari Krishnan Parthasarathi, Björn Hoffmeister, Spyros Matsoukas, Arindam Mandal, Nikko Ström, Sri Garimella

Interspeech 2015

2015

We investigate the problem of speaker adaptation of DNN acoustic models in two settings: the traditional unsupervised adaptation and a supervised adaptation (SuA) where a few minutes of transcribed speech is available. SuA presents additional difficulties when a test speaker’s adaptation information does not match the registered speaker’s information. Employing feature-space maximum likelihood linear regression

Conversational AI

How Amazon Web Services uses formal methods

Chris Newcombe, Tim Rath, Fan Zhang, Bogdan Munteanu, Marc Brooker, Michael Deardeuff

Communications of the ACM

2015

Since 2011, ENGINEERS at Amazon Web Services (AWS) have used formal specification and model checking to help solve difficult design problems in critical systems. Here, we describe our motivation and experience, what has worked well in our problem domain, and what has not. When discussing personal experience we refer to the authors by their initials. At AWS we strive to build services that are simple for

Cloud and systems

Dynamo: Amazon’s highly available key-value store

Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman, Alex Pilchin, Swaminathan Sivasubramanian, Peter Vosshall, Werner Vogels

ACM Symposium on Operating System Principles

2007

Reliability at massive scale is one of the biggest challenges we face at Amazon.com, one of the largest e-commerce operations in the world; even the slightest outage has significant financial consequences and impacts customer trust. The Amazon.com platform, which provides services for many web sites worldwide, is implemented on top of an infrastructure of tens of thousands of servers and network components

Cloud and systems

Search results

Work with us