Search - Amazon Science

Bounding the integrality distance of LP relaxations for structured prediction

Ben London, Ofer Meshi, Adrian Weller

NeurIPS 2016

2016

In structured prediction, a predictor optimizes an objective function over a combinatorial search space, such as the set of all image segmentations, or the set of all part-of-speech taggings. Unfortunately, finding the optimal structured labeling—sometimes referred to as maximum a posteriori (MAP) inference—is, in general, NP-hard [12], due to the combinatorial structure of the problem. Many inference approximations

Operations research and optimization

Near-optimal disjoint-path facility location through set cover by pairs

David S. Johnson, Lee Breslau, Ilias Diakonikolas, Nick Duffield, Yu Gu, Mohammad Taghi Hajiaghayi, Howard Karloff, Mauricio G. C. Resende, Subhabrata Sen

arXiv

2016

In this paper we consider two special cases of the “cover-by-pairs” optimization problem that arise when we need to place facilities so that each customer is served by two facilities that reach it by disjoint shortest paths. These problems arise in a network traffic monitoring scheme proposed by Breslau et al. and have potential applications to content distribution. The “set-disjoint” variant applies to

Operations research and optimization

Adaptive, personalized diversity for visual discovery

Choon Hui Teo, Houssam Nassif, Daniel N. Hill, Sriram Srinivasan, Mitchell Goodman, Vijai Mohan, S. V. N. Vishwanathan

RecSys 2016

2016

Search queries are appropriate when users have explicit intent, but they perform poorly when the intent is difficult to express or if the user is simply looking to be inspired. Visual browsing systems allow e-commerce platforms to address these scenarios while offering the user an engaging shopping experience. Here we explore extensions in the direction of adaptive personalization and item diversification

Search and information retrieval

Amazon Search: The joy of ranking products

Daria Sorokina, Erick Cantú-Paz

SIGIR 2016

2016

Amazon is one of the world’s largest e-commerce sites and Amazon Search powers the majority of Amazon’s sales. As a consequence, even small improvements in relevance ranking both positively influence the shopping experience of millions of customers and significantly impact revenue. In the past, Amazon’s product search engine consisted of several handtuned ranking functions using a handful of input features

Search and information retrieval

Optimization by GRASP: Greedy randomized adaptive search procedures

Mauricio G. C. Resende, Celso C. Ribeiro

Springer Nature

2016

This is the first book to cover GRASP (Greedy Randomized Adaptive Search Procedures), a metaheuristic that has enjoyed wide success in practice with a broad range of applications to real-world combinatorial optimization problems. The state-of-the-art coverage and carefully crafted pedagogical style lends this book highly accessible as an introductory text not only to GRASP, but also to combinatorial optimization

Operations research and optimization

Model compression applied to small-footprint keyword spotting

George Tucker, Minhua Wu, Ming Sun, Sankaran Panchapagesan, Gengshen Fu, Shiv Vitaladevuni

Interspeech 2016

2016

Several consumer speech devices feature voice interfaces that perform on-device keyword spotting to initiate user interactions. Accurate on-device keyword spotting within a tight CPU budget is crucial for such devices. Motivated by this, we investigated two ways to improve deep neural network (DNN) acoustic models for keyword spotting without increasing CPU usage. First, we used low-rank weight matrices

Conversational AI

Optimizing Speech Recognition Evaluation Using Stratified Sampling

Janne Pylkkonen, Thomas Drugman, Max Bisani

Interspeech 2016

2016

Producing large enough quantities of high-quality transcriptions for accurate and reliable evaluation of an automatic speech recognition (ASR) system can be costly. It is therefore desirable to minimize the manual transcription work for producing metrics with an agreed precision. In this paper we demonstrate how to improve ASR evaluation precision using stratified sampling. We show that by altering the

Conversational AI

Sparse and low-rank decomposition for big data systems via smoothed Riemannian optimization

Yuanming Shi, Bamdev Mishra

NeurIPS 2016

2016

We provide a unified modeling framework of sparse and low-rank decomposition to investigate the fundamental limits of communication, computation, and storage in mobile big data systems. The resulting sparse and low-rank optimization problems are highly intractable non-convex optimization problems and conventional convex relaxation approaches are inapplicable, for which we propose a smoothed Riemannian optimization

Information and knowledge management

Diversifying music recommendations

Houssam Nassif, Kemal Oral Cansizlar, Mitchell Goodman, S. V. N. Vishwanathan

ICML 2016

2016

We compare submodular and Jaccard methods to diversify Amazon Music recommendations. Submodularity significantly improves recommendation quality and user engagement. Unlike the Jaccard method, our submodular approach incorporates item relevance score within its optimization function, and produces a relevant and uniformly diverse set.

Search and information retrieval

A Riemannian gossip approach to decentralized matrix completion

Bamdev Mishra, Hiroyuki Kasai, Hiroyuki Sato

NeurIPS 2016

2016

In this paper, we propose novel gossip algorithms for the low-rank decentralized matrix completion problem. The proposed approach is on the Riemannian Grassmann manifold that allows local matrix completion by different agents while achieving asymptotic consensus on the global low-rank factors. The resulting approach is scalable and parallelizable. Our numerical experiments show the good performance of the

Operations research and optimization

Adaptive algorithms for online convex optimization with long-term constraints

Rodolphe Jenatton, Jim Huang, Cédric Archambeau

ICML 2016

2016

We present an adaptive online gradient descent algorithm to solve online convex optimization problems with long-term constraints, which are constraints that need to be satisfied when accumulated over a finite number of rounds T, but can be violated in intermediate rounds. For some user-defined trade-off parameter β ∈ (0, 1), the proposed algorithm achieves cumulative regret bounds of O(T max{β,1−β} ) and

Operations research and optimization

Compacting neural network classifiers via dropout training

Yotaro Kubo, George Tucker, Simon Wiesler

NeurIPS 2016

2016

We introduce dropout compaction, a novel method for training feed-forward neural networks which realizes the performance gains of training a large model with dropout regularization, yet extracts a compact neural network for run-time efficiency. In the proposed method, we introduce a sparsity-inducing prior on the per unit dropout retention probability so that the optimizer can effectively prune hidden units

Machine learning

Machine learning (ML) in the real world

Gourav Roy, Rajeev Rastogi, Vineet Chaoji

VLDB 2016

2016

Machine Learning (ML) has become a mature technology that is being applied to a wide range of business problems such as web search, online advertising, product recommendations, object recognition, and so on. As a result, it has become imperative for researchers and practitioners to have a fundamental understanding of ML concepts and practical knowledge of end-to-end modeling. This tutorial takes a hands-on

Machine learning

Efficient exploration of text regions in natural scene images using adaptive image sampling

Ismet Zeki Yalniz, Douglas Gray, R. Manmatha

ECCV 2016

2016

An adaptive image sampling framework is proposed for identifying text regions in natural scene images. A small fraction of the pixels actually correspond to text regions. It is desirable to eliminate non-text regions at the early stages of text detection. First, the image is sampled row-by-row at a specific rate and each row is tested for containing text using an 1D adaptation of the Maximally Stable Extremal

Computer vision

Anchored speech detection

Roland Maas, Sree Hari Krishnan Parthasarathi, Brian King, Ruitong Huang, Björn Hoffmeister

Interspeech 2016

2016

We propose two new methods of speech detection in the context of voice-controlled far-field appliances. While conventional detection methods are designed to differentiate between speech and nonspeech, we aim at distinguishing desired speech, which we define as speech originating from the person interacting with the device, from background noise and interfering talkers. Our two proposed methods use the first

Conversational AI

Active and Semi-Supervised Learning in ASR: Benefits on the Acoustic and Language Models

Thomas Drugman, Janne Pylkkonen, Reinhard Kneser

Interspeech 2016

2016

The goal of this paper is to simulate the benefits of jointly applying active learning (AL) and semi-supervised training (SST) in a new speech recognition application. Our data selection approach relies on confidence filtering, and its impact on both the acoustic and language models (AM and LM) is studied. While AL is known to be beneficial to AM training, we show that it also carries out substantial improvements

Conversational AI

LATTICE RNN: Recurrent Neural Networks over Lattices

Faisal Ladhak, Ankur Gandhe, Markus Dreyer, Lambert Mathias, Ariya Rastrow, Björn Hoffmeister

Interspeech 2016

2016

We present a new model called LATTICERNN, which generalizes recurrent neural networks (RNNs) to process weighted lattices as input, instead of sequences. A LATTICERNN can encode the complete structure of a lattice into a dense representation, which makes it suitable to a variety of problems, including rescoring, classifying, parsing, or translating lattices using deep neural networks (DNNs). In this paper

Conversational AI

Kalman Folding 5: Non-linear models and the EKF

Brian Beckman

ACM 2016

2016

We exhibit a foldable Extended Kalman Filter that internally integrates non-linear equations of motion with a nested fold of generic integrators over lazy streams in constant memory. Functional form allows us to switch integrators easily and to diagnose filter divergence accurately, achieving orders of magnitude better speed than the source example from the literature. As with all Kalman folds, we can move

Cloud and systems

Riemannian stochastic variance reduced gradient on Grassmann manifold

Bamdev Mishra, Hiroyuki Kasai, Hiroyuki Sato

NeurIPS 2016

2016

Stochastic variance reduction algorithms have recently become popular for minimizing the average of a large, but finite, number of loss functions. In this paper, we propose a novel Riemannian extension of the Euclidean stochastic variance reduced gradient algorithm (R-SVRG) to a compact manifold search space.

Operations research and optimization

Velocity-based storage assignment in semi-automated storage systems

Rong Yuan, Tolga Cezik, Stephen C. Graves

MSOM 2016

2016

Our research focuses on the storage decision in a semi-automated storage system, where the inventory is stored on mobile storage pods. In a typical system, each storage pod carries a mixture of items, and the inventory of each item is spread over multiple storage pods.

Operations research and optimization

Search results

Work with us