Search - Amazon Science

Search-based Evaluation from Truth Transcripts for Voice Search Applications

Francois Mairesse, Paul Raccuglia, Shiv Vitaladevuni

SIGIR 2016

2016

Voice search applications are typically evaluated by comparing the predicted query to a reference human transcript, regardless of the search results returned by the query. While we find that an exact transcript match is highly indicative of user satisfaction, a transcript which does not match the reference still produces satisfactory search results a significant fraction of the time. This paper therefore

Conversational AI

Kalman Folding 5: Non-linear models and the EKF

Brian Beckman

ACM 2016

2016

We exhibit a foldable Extended Kalman Filter that internally integrates non-linear equations of motion with a nested fold of generic integrators over lazy streams in constant memory. Functional form allows us to switch integrators easily and to diagnose filter divergence accurately, achieving orders of magnitude better speed than the source example from the literature. As with all Kalman folds, we can move

Cloud and systems

Active and Semi-Supervised Learning in ASR: Benefits on the Acoustic and Language Models

Thomas Drugman, Janne Pylkkonen, Reinhard Kneser

Interspeech 2016

2016

The goal of this paper is to simulate the benefits of jointly applying active learning (AL) and semi-supervised training (SST) in a new speech recognition application. Our data selection approach relies on confidence filtering, and its impact on both the acoustic and language models (AM and LM) is studied. While AL is known to be beneficial to AM training, we show that it also carries out substantial improvements

Conversational AI

LATTICE RNN: Recurrent Neural Networks over Lattices

Faisal Ladhak, Ankur Gandhe, Markus Dreyer, Lambert Mathias, Ariya Rastrow, Björn Hoffmeister

Interspeech 2016

2016

We present a new model called LATTICERNN, which generalizes recurrent neural networks (RNNs) to process weighted lattices as input, instead of sequences. A LATTICERNN can encode the complete structure of a lattice into a dense representation, which makes it suitable to a variety of problems, including rescoring, classifying, parsing, or translating lattices using deep neural networks (DNNs). In this paper

Conversational AI

Anchored speech detection

Roland Maas, Sree Hari Krishnan Parthasarathi, Brian King, Ruitong Huang, Björn Hoffmeister

Interspeech 2016

2016

We propose two new methods of speech detection in the context of voice-controlled far-field appliances. While conventional detection methods are designed to differentiate between speech and nonspeech, we aim at distinguishing desired speech, which we define as speech originating from the person interacting with the device, from background noise and interfering talkers. Our two proposed methods use the first

Conversational AI

Adaptive, personalized diversity for visual discovery

Choon Hui Teo, Houssam Nassif, Daniel N. Hill, Sriram Srinivasan, Mitchell Goodman, Vijai Mohan, S. V. N. Vishwanathan

RecSys 2016

2016

Search queries are appropriate when users have explicit intent, but they perform poorly when the intent is difficult to express or if the user is simply looking to be inspired. Visual browsing systems allow e-commerce platforms to address these scenarios while offering the user an engaging shopping experience. Here we explore extensions in the direction of adaptive personalization and item diversification

Search and information retrieval

Amazon Search: The joy of ranking products

Daria Sorokina, Erick Cantú-Paz

SIGIR 2016

2016

Amazon is one of the world’s largest e-commerce sites and Amazon Search powers the majority of Amazon’s sales. As a consequence, even small improvements in relevance ranking both positively influence the shopping experience of millions of customers and significantly impact revenue. In the past, Amazon’s product search engine consisted of several handtuned ranking functions using a handful of input features

Search and information retrieval

Efficient exploration of text regions in natural scene images using adaptive image sampling

Ismet Zeki Yalniz, Douglas Gray, R. Manmatha

ECCV 2016

2016

An adaptive image sampling framework is proposed for identifying text regions in natural scene images. A small fraction of the pixels actually correspond to text regions. It is desirable to eliminate non-text regions at the early stages of text detection. First, the image is sampled row-by-row at a specific rate and each row is tested for containing text using an 1D adaptation of the Maximally Stable Extremal

Computer vision

Bounding the integrality distance of LP relaxations for structured prediction

Ben London, Ofer Meshi, Adrian Weller

NeurIPS 2016

2016

In structured prediction, a predictor optimizes an objective function over a combinatorial search space, such as the set of all image segmentations, or the set of all part-of-speech taggings. Unfortunately, finding the optimal structured labeling—sometimes referred to as maximum a posteriori (MAP) inference—is, in general, NP-hard [12], due to the combinatorial structure of the problem. Many inference approximations

Operations research and optimization

Bayesian intermittent demand forecasting for large inventories

Matthias Seeger, David Salinas, Valentin Flunkert

NeurIPS 2016

2016

We present a scalable and robust Bayesian method for demand forecasting in the context of a large e-commerce platform, paying special attention to intermittent and bursty target statistics. Inference is approximated by the Newton-Raphson algorithm, reduced to linear-time Kalman smoothing, which allows us to operate on several orders of magnitude larger problems than previous related work. In a study on

Operations research and optimization

Online dual decomposition for performance and delivery-based distributed ad allocation

Jim Huang, Rodolphe Jenatton, Cédric Archambeau

KDD 2016

2016

Online optimization is central to display advertising, where we must sequentially allocate ad impressions to maximize the total welfare among advertisers, while respecting various advertiser-specified long-term constraints (e.g., total amount of the ad’s budget that is consumed at the end of the campaign). In this paper, we present the online dual decomposition (ODD) framework for large-scale, online, distributed

Operations research and optimization

Riemannian stochastic variance reduced gradient on Grassmann manifold

Bamdev Mishra, Hiroyuki Kasai, Hiroyuki Sato

NeurIPS 2016

2016

Stochastic variance reduction algorithms have recently become popular for minimizing the average of a large, but finite, number of loss functions. In this paper, we propose a novel Riemannian extension of the Euclidean stochastic variance reduced gradient algorithm (R-SVRG) to a compact manifold search space.

Operations research and optimization

Velocity-based storage assignment in semi-automated storage systems

Rong Yuan, Tolga Cezik, Stephen C. Graves

MSOM 2016

2016

Our research focuses on the storage decision in a semi-automated storage system, where the inventory is stored on mobile storage pods. In a typical system, each storage pod carries a mixture of items, and the inventory of each item is spread over multiple storage pods.

Operations research and optimization

Compacting neural network classifiers via dropout training

Yotaro Kubo, George Tucker, Simon Wiesler

NeurIPS 2016

2016

We introduce dropout compaction, a novel method for training feed-forward neural networks which realizes the performance gains of training a large model with dropout regularization, yet extracts a compact neural network for run-time efficiency. In the proposed method, we introduce a sparsity-inducing prior on the per unit dropout retention probability so that the optimizer can effectively prune hidden units

Machine learning

Learning structured predictors from bandit feedback for interactive NLP

Artem Sokolov, Julia Kreutzer, Chirstopher Lo, Stefan Riezler

ACL 2016

2016

Structured prediction from bandit feedback describes a learning scenario where instead of having access to a gold standard structure, a learner only receives partial feedback in form of the loss value of a predicted structure. We present new learning objectives and algorithms for this interactive scenario, focusing on convergence speed and ease of elicitability of feedback. We present supervised-to-bandit

Conversational AI

Generative adversarial structured networks

Ben London, Alex Schwing

NeurIPS 2016

2016

We propose a technique that combines generative adversarial networks with probabilistic graphical models to explicitly model dependencies in structured distributions. Generative adversarial structured networks (GASNs) produce samples by passing random inputs through a neural network to construct the potentials of a graphical model; maximum a-posteriori inference in this graphical model then yields a sample

Machine learning

Handling confounding for realistic off-policy evaluation

Saurabh Sohoney, Nikita Prabhu, Vineet Chaoji

WWW 2018

2016

Inverse Propensity Score estimator (IPS) is a basic, unbiased, offpolicy evaluation technique to measure the impact of a user-interactive system without serving live traffic. We present our work on applying IPS to real-world settings by addressing some practical challenges, thereby enabling successful policy evaluation. In particular, we show that off-policy evaluation can be impossible in the absence of

Machine learning

Machine learning (ML) in the real world

Gourav Roy, Rajeev Rastogi, Vineet Chaoji

VLDB 2016

2016

Machine Learning (ML) has become a mature technology that is being applied to a wide range of business problems such as web search, online advertising, product recommendations, object recognition, and so on. As a result, it has become imperative for researchers and practitioners to have a fundamental understanding of ML concepts and practical knowledge of end-to-end modeling. This tutorial takes a hands-on

Machine learning

Robust random cut forest based anomaly detection on streams

Sudipto Guha, Nina Mishra, Gourav Roy, Okke Schrijvers

ICML 2016

2016

In this paper we focus on the anomaly detection problem for dynamic data streams through the lens of random cut forests. We investigate a robust random cut data structure that can be used as a sketch or synopsis of the input stream. We provide a plausible definition of non-parametric anomalies based on the influence of an unseen point on the remainder of the data, i.e., the externality imposed by that point

Machine learning

Sparse and low-rank decomposition for big data systems via smoothed Riemannian optimization

Yuanming Shi, Bamdev Mishra

NeurIPS 2016

2016

We provide a unified modeling framework of sparse and low-rank decomposition to investigate the fundamental limits of communication, computation, and storage in mobile big data systems. The resulting sparse and low-rank optimization problems are highly intractable non-convex optimization problems and conventional convex relaxation approaches are inapplicable, for which we propose a smoothed Riemannian optimization

Information and knowledge management

Search results

Work with us