Search - Amazon Science

Bounding the integrality distance of LP relaxations for structured prediction

Ben London, Ofer Meshi, Adrian Weller

NeurIPS 2016

2016

In structured prediction, a predictor optimizes an objective function over a combinatorial search space, such as the set of all image segmentations, or the set of all part-of-speech taggings. Unfortunately, finding the optimal structured labeling—sometimes referred to as maximum a posteriori (MAP) inference—is, in general, NP-hard [12], due to the combinatorial structure of the problem. Many inference approximations

Operations research and optimization

Near-optimal disjoint-path facility location through set cover by pairs

David S. Johnson, Lee Breslau, Ilias Diakonikolas, Nick Duffield, Yu Gu, Mohammad Taghi Hajiaghayi, Howard Karloff, Mauricio G. C. Resende, Subhabrata Sen

arXiv

2016

In this paper we consider two special cases of the “cover-by-pairs” optimization problem that arise when we need to place facilities so that each customer is served by two facilities that reach it by disjoint shortest paths. These problems arise in a network traffic monitoring scheme proposed by Breslau et al. and have potential applications to content distribution. The “set-disjoint” variant applies to

Operations research and optimization

Scalability comparison scripts for deep learning frameworks

Vishaal Kapoor, Indu Thangakrishnan, Piyush Ghai, Frank Liu, Vandana Kannan, Jake Lee, Qing Lan, Suraj Kota, Anirudh Subramanian, Manu Seth, Andrew Ayres, Roshani Nagmote, Chaitanya Prakash Bapat, Anton Chernov, Dhanasekar Karuppasamy, Hao Jin, Rohit Srivastava, Sandeep Krishnamurthy, Amol Lele, Henri Yandell

2016

This repository contains scripts that compares the scalability of deep learning frameworks. The scripts train Inception v3 and AlexNet using synchronous stochastic gradient descent (SGD). To run the comparison in reasonable time, we run few tens of iterations of SGD and compute the throughput as images processed per second. Comparisons can be done on clusters created with AWS CloudFormation using the Amazon

Machine learning

Amazon Redshift and the case for simpler data warehouses

Anurag Gupta, Deepak Agarwal, Derek Tan, Jakub Kulesza, Rahul Pathak, Stefano Stefani, Vidhya Srinivasan

ACM SIGMOD 2015

2015

Amazon Redshift is a fast, fully managed, petabyte-scale data warehouse solution that makes it simple and cost-effective to efficiently analyze large volumes of data using existing business intelligence tools. Since launching in February 2013, it has been Amazon Web Service’s (AWS) fastest growing service, with many thousands of customers and many petabytes of data under management. Amazon Redshift’s pace

Cloud and systems

Robust i-Vector Based Adaptation of DNN Acoustic Model for Speech Recognition

Sri Garimella, Arindam Mandal, Nikko Ström, Björn Hoffmeister, Spyros Matsoukas, Sree Hari Krishnan Parthasarathi

Interspeech 2015

2015

In the past, conventional i-vectors based on a Universal Background Model (UBM) have been successfully used as input features to adapt a Deep Neural Network (DNN) Acoustic Model (AM) for Automatic Speech Recognition (ASR). In contrast, this paper introduces Hidden Markov Model (HMM) based ivectors that use HMM state alignment information from an ASR system for estimating i-vectors. Further, we propose passing

Conversational AI

Accurate Endpointing with Expected Pause Duration

Baiyang Liu, Björn Hoffmeister, Ariya Rastrow

Interspeech 2015

2015

In an online automatic speech recognition system, the role of the endpoint detector is to infer when a user has finished speaking a query. Accurate and low-latency endpoint detection is crucial for natural voice interaction. Classic voice activity detector (VAD) based approaches monitor the incoming audio and trigger when a sufficiently long pause is detected. Such approaches are typically limited due to

Conversational AI

Scalable Distributed DNN Training Using Commodity GPU Cloud Computing

Nikko Ström

Interspeech 2015

2015

We introduce a new method for scaling up distributed Stochastic Gradient Descent (SGD) training of Deep Neural Networks (DNN). The method solves the well-known communication bottleneck problem that arises for data-parallel SGD because compute nodes frequently need to synchronize a replica of the model. We solve it by purposefully controlling the rate of weight-update per individual weight, which is in contrast

Cloud and systems

On challenges in machine learning model management

Sebastian Schelter, Felix Biessmann, Tim Januschowski, David Salinas, Stephan Seufert, Gyuri Szarvas

IEEE Data Engineering Bulletin

2015

The training, maintenance, deployment, monitoring, organization and documentation of machine learning (ML) models — in short, model management — is a critical task in virtually all production ML use cases. Wrong model management decisions can lead to poor performance of a ML system and result in high maintenance cost. As research on both infrastructure and algorithms is quickly evolving, there is a lack

Cloud and systems

fMLLR Based Feature-Space Speaker Adaptation of DNN Acoustic Models

Sree Hari Krishnan Parthasarathi, Björn Hoffmeister, Spyros Matsoukas, Arindam Mandal, Nikko Ström, Sri Garimella

Interspeech 2015

2015

We investigate the problem of speaker adaptation of DNN acoustic models in two settings: the traditional unsupervised adaptation and a supervised adaptation (SuA) where a few minutes of transcribed speech is available. SuA presents additional difficulties when a test speaker’s adaptation information does not match the registered speaker’s information. Employing feature-space maximum likelihood linear regression

Conversational AI

How Amazon Web Services uses formal methods

Chris Newcombe, Tim Rath, Fan Zhang, Bogdan Munteanu, Marc Brooker, Michael Deardeuff

Communications of the ACM

2015

Since 2011, ENGINEERS at Amazon Web Services (AWS) have used formal specification and model checking to help solve difficult design problems in critical systems. Here, we describe our motivation and experience, what has worked well in our problem domain, and what has not. When discussing personal experience we refer to the authors by their initials. At AWS we strive to build services that are simple for

Cloud and systems

Dynamo: Amazon’s highly available key-value store

Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman, Alex Pilchin, Swaminathan Sivasubramanian, Peter Vosshall, Werner Vogels

ACM Symposium on Operating System Principles

2007

Reliability at massive scale is one of the biggest challenges we face at Amazon.com, one of the largest e-commerce operations in the world; even the slightest outage has significant financial consequences and impacts customer trust. The Amazon.com platform, which provides services for many web sites worldwide, is implemented on top of an infrastructure of tens of thousands of servers and network components

Cloud and systems

Search results

Work with us