Search - Amazon Science

Kalman Folding 5: Non-linear models and the EKF

ACM 2016

2016

We exhibit a foldable Extended Kalman Filter that internally integrates non-linear equations of motion with a nested fold of generic integrators over lazy streams in constant memory. Functional form allows us to switch integrators easily and to diagnose filter divergence accurately, achieving orders of magnitude better speed than the source example from the literature. As with all Kalman folds, we can move

Cloud and systems

LATTICE RNN: Recurrent Neural Networks over Lattices

Faisal Ladhak, Ankur Gandhe, Markus Dreyer, Lambert Mathias, Ariya Rastrow, Björn Hoffmeister

Interspeech 2016

2016

We present a new model called LATTICERNN, which generalizes recurrent neural networks (RNNs) to process weighted lattices as input, instead of sequences. A LATTICERNN can encode the complete structure of a lattice into a dense representation, which makes it suitable to a variety of problems, including rescoring, classifying, parsing, or translating lattices using deep neural networks (DNNs). In this paper

Conversational AI

Optimization by GRASP: Greedy randomized adaptive search procedures

Mauricio G. C. Resende, Celso C. Ribeiro

Springer Nature

2016

This is the first book to cover GRASP (Greedy Randomized Adaptive Search Procedures), a metaheuristic that has enjoyed wide success in practice with a broad range of applications to real-world combinatorial optimization problems. The state-of-the-art coverage and carefully crafted pedagogical style lends this book highly accessible as an introductory text not only to GRASP, but also to combinatorial optimization

Operations research and optimization

Max-Pooling Loss Trained Long Short Term Memory Network For Small-Footprint Keyword Spotting

Ming Sun, Anirudh Raju, George Tucker, Sankaran Panchapagesan, Gengshen Fu, Arindam Mandal, Spyros Matsoukas, Nikko Ström, Shiv Vitaladevuni

SLT 2016

2016

We propose a max-pooling based loss function for training Long Short-Term Memory (LSTM) networks for small-footprint keyword spotting (KWS), with low CPU, memory, and latency requirements. The max-pooling loss training can be further guided by initializing with a cross-entropy loss trained network. A posterior smoothing based evaluation approach is employed to measure keyword spotting performance. Our experimental

Conversational AI

Model compression applied to small-footprint keyword spotting

George Tucker, Minhua Wu, Ming Sun, Sankaran Panchapagesan, Gengshen Fu, Shiv Vitaladevuni

Interspeech 2016

2016

Several consumer speech devices feature voice interfaces that perform on-device keyword spotting to initiate user interactions. Accurate on-device keyword spotting within a tight CPU budget is crucial for such devices. Motivated by this, we investigated two ways to improve deep neural network (DNN) acoustic models for keyword spotting without increasing CPU usage. First, we used low-rank weight matrices

Conversational AI

Optimizing Speech Recognition Evaluation Using Stratified Sampling

Janne Pylkkonen, Thomas Drugman, Max Bisani

Interspeech 2016

2016

Producing large enough quantities of high-quality transcriptions for accurate and reliable evaluation of an automatic speech recognition (ASR) system can be costly. It is therefore desirable to minimize the manual transcription work for producing metrics with an agreed precision. In this paper we demonstrate how to improve ASR evaluation precision using stratified sampling. We show that by altering the

Conversational AI

Sparse and low-rank decomposition for big data systems via smoothed Riemannian optimization

Yuanming Shi, Bamdev Mishra

NeurIPS 2016

2016

We provide a unified modeling framework of sparse and low-rank decomposition to investigate the fundamental limits of communication, computation, and storage in mobile big data systems. The resulting sparse and low-rank optimization problems are highly intractable non-convex optimization problems and conventional convex relaxation approaches are inapplicable, for which we propose a smoothed Riemannian optimization

Information and knowledge management

Adaptive algorithms for online convex optimization with long-term constraints

Rodolphe Jenatton, Jim Huang, Cédric Archambeau

ICML 2016

2016

We present an adaptive online gradient descent algorithm to solve online convex optimization problems with long-term constraints, which are constraints that need to be satisfied when accumulated over a finite number of rounds T, but can be violated in intermediate rounds. For some user-defined trade-off parameter β ∈ (0, 1), the proposed algorithm achieves cumulative regret bounds of O(T max{β,1−β} ) and

Operations research and optimization

Robust random cut forest based anomaly detection on streams

Sudipto Guha, Nina Mishra, Gourav Roy, Okke Schrijvers

ICML 2016

2016

In this paper we focus on the anomaly detection problem for dynamic data streams through the lens of random cut forests. We investigate a robust random cut data structure that can be used as a sketch or synopsis of the input stream. We provide a plausible definition of non-parametric anomalies based on the influence of an unseen point on the remainder of the data, i.e., the externality imposed by that point

Machine learning

Active and Semi-Supervised Learning in ASR: Benefits on the Acoustic and Language Models

Thomas Drugman, Janne Pylkkonen, Reinhard Kneser

Interspeech 2016

2016

The goal of this paper is to simulate the benefits of jointly applying active learning (AL) and semi-supervised training (SST) in a new speech recognition application. Our data selection approach relies on confidence filtering, and its impact on both the acoustic and language models (AM and LM) is studied. While AL is known to be beneficial to AM training, we show that it also carries out substantial improvements

Conversational AI

Adaptive, personalized diversity for visual discovery

Choon Hui Teo, Houssam Nassif, Daniel N. Hill, Sriram Srinivasan, Mitchell Goodman, Vijai Mohan, S. V. N. Vishwanathan

RecSys 2016

2016

Search queries are appropriate when users have explicit intent, but they perform poorly when the intent is difficult to express or if the user is simply looking to be inspired. Visual browsing systems allow e-commerce platforms to address these scenarios while offering the user an engaging shopping experience. Here we explore extensions in the direction of adaptive personalization and item diversification

Search and information retrieval

Amazon Search: The joy of ranking products

Daria Sorokina, Erick Cantú-Paz

SIGIR 2016

2016

Amazon is one of the world’s largest e-commerce sites and Amazon Search powers the majority of Amazon’s sales. As a consequence, even small improvements in relevance ranking both positively influence the shopping experience of millions of customers and significantly impact revenue. In the past, Amazon’s product search engine consisted of several handtuned ranking functions using a handful of input features

Search and information retrieval

Multi-task learning and Weighted Cross-entropy for DNN-based Keyword Spotting

Sankaran Panchapagesan, Ming Sun, Aparna Khare, Spyros Matsoukas, Arindam Mandal, Björn Hoffmeister, Shiv Vitaladevuni

Interspeech 2016

2016

We propose improved Deep Neural Network (DNN) training loss functions for more accurate single keyword spotting on resource-constrained embedded devices. The loss function modifications consist of a combination of multi-task training and weighted cross entropy. In the multi-task architecture, the keyword DNN acoustic model is trained with two tasks in parallel - the main task of predicting the keyword-specific

Conversational AI

Bayesian intermittent demand forecasting for large inventories

Matthias Seeger, David Salinas, Valentin Flunkert

NeurIPS 2016

2016

We present a scalable and robust Bayesian method for demand forecasting in the context of a large e-commerce platform, paying special attention to intermittent and bursty target statistics. Inference is approximated by the Newton-Raphson algorithm, reduced to linear-time Kalman smoothing, which allows us to operate on several orders of magnitude larger problems than previous related work. In a study on

Operations research and optimization

Online dual decomposition for performance and delivery-based distributed ad allocation

Jim Huang, Rodolphe Jenatton, Cédric Archambeau

KDD 2016

2016

Online optimization is central to display advertising, where we must sequentially allocate ad impressions to maximize the total welfare among advertisers, while respecting various advertiser-specified long-term constraints (e.g., total amount of the ad’s budget that is consumed at the end of the campaign). In this paper, we present the online dual decomposition (ODD) framework for large-scale, online, distributed

Operations research and optimization

Riemannian stochastic variance reduced gradient on Grassmann manifold

Bamdev Mishra, Hiroyuki Kasai, Hiroyuki Sato

NeurIPS 2016

2016

Stochastic variance reduction algorithms have recently become popular for minimizing the average of a large, but finite, number of loss functions. In this paper, we propose a novel Riemannian extension of the Euclidean stochastic variance reduced gradient algorithm (R-SVRG) to a compact manifold search space.

Operations research and optimization

Velocity-based storage assignment in semi-automated storage systems

Rong Yuan, Tolga Cezik, Stephen C. Graves

MSOM 2016

2016

Our research focuses on the storage decision in a semi-automated storage system, where the inventory is stored on mobile storage pods. In a typical system, each storage pod carries a mixture of items, and the inventory of each item is spread over multiple storage pods.

Operations research and optimization

Bounding the integrality distance of LP relaxations for structured prediction

Ben London, Ofer Meshi, Adrian Weller

NeurIPS 2016

2016

In structured prediction, a predictor optimizes an objective function over a combinatorial search space, such as the set of all image segmentations, or the set of all part-of-speech taggings. Unfortunately, finding the optimal structured labeling—sometimes referred to as maximum a posteriori (MAP) inference—is, in general, NP-hard [12], due to the combinatorial structure of the problem. Many inference approximations

Operations research and optimization

Near-optimal disjoint-path facility location through set cover by pairs

David S. Johnson, Lee Breslau, Ilias Diakonikolas, Nick Duffield, Yu Gu, Mohammad Taghi Hajiaghayi, Howard Karloff, Mauricio G. C. Resende, Subhabrata Sen

arXiv

2016

In this paper we consider two special cases of the “cover-by-pairs” optimization problem that arise when we need to place facilities so that each customer is served by two facilities that reach it by disjoint shortest paths. These problems arise in a network traffic monitoring scheme proposed by Breslau et al. and have potential applications to content distribution. The “set-disjoint” variant applies to

Operations research and optimization

Scalability comparison scripts for deep learning frameworks

Vishaal Kapoor, Indu Thangakrishnan, Piyush Ghai, Frank Liu, Vandana Kannan, Jake Lee, Qing Lan, Suraj Kota, Anirudh Subramanian, Manu Seth, Andrew Ayres, Roshani Nagmote, Chaitanya Prakash Bapat, Anton Chernov, Dhanasekar Karuppasamy, Hao Jin, Rohit Srivastava, Sandeep Krishnamurthy, Amol Lele, Henri Yandell

2016

This repository contains scripts that compares the scalability of deep learning frameworks. The scripts train Inception v3 and AlexNet using synchronous stochastic gradient descent (SGD). To run the comparison in reasonable time, we run few tens of iterations of SGD and compute the throughput as images processed per second. Comparisons can be done on clusters created with AWS CloudFormation using the Amazon

Machine learning

Search results

Work with us