Search - Amazon Science

SegTree Transformer: Iterative refinement of hierarchical features

Zihao Ye, Qipeng Guo, Quan Gan, Zheng Zhang

ICLR 2019 Workshop on Representation Learning on Graphs and Manifolds

2019

The building block of Transformer can be seen as inducing message passing over a complete graph whose nodes correspond to input tokens. Such dense connections make the Transformer data-hungry. Star-Transformer exploits short-term dependencies more heavily by keeping the connections between adjacent tokens but relaying long dependencies via a central node, thereby reducing the number of connections from

Machine learning

Robust non-negative block sparse coding for acoustic novelty detection

Ritwik Giri, Arvindh Krishnaswamy, Karim Helwani

DCASE 2019

2019

In this paper we address the problem of detecting previously unseen novel audio events in the presence of real-life acoustic backgrounds. Specifically, during training, we learn subspaces corresponding to each acoustic background, and during testing the audio frame in question is decomposed into a component that lies on the mixture of subspaces and a super gaussian outlier component.Based on the energy

Conversational AI

The impact of big data on firm performance: An empirical investigation

Pat Bajari, Victor Chernozhukov, Ali Hortaçsu , Junichi Suzuki

AEA 2019

2019

We examine the impact of "big data" on firm performance in the context of forecast accuracy using proprietary retail sales data obtained from Amazon. We measure the accuracy of forecasts in two relevant dimensions: the number of products (N), and the number of time periods for which a product is available for sale (T). Theory suggests diminishing returns to larger N and T, with relative forecast errors

Economics

Multi-objective relevance ranking

Michinari Momma, Ali Bagheri Garakani, Yi Sun

SIGIR 2019 Workshop on e-Commerce

2019

In this paper, we introduce an Augmented Lagrangian based method in a search relevance ranking algorithm to incorporate the multi-dimensional nature of relevance and business constraints, both of which are the requirements for building relevance ranking models in production. The off-the-shelf solutions cannot handle such complex objectives and therefore, modelers are left hand-tuning of parameters that

Search and information retrieval

Further advances in open domain dialog systems in the Third Alexa Prize SocialBot Grand Challenge

Raefer Gabriel, Yang Liu, Anna Gottardi, Mihail Eric, Anju Khatri, Anjali Chadha, Qinlang Chen, Behnam Hedayatnia, Pankaj Rajan, Ali Binici, Shui Hu, Karthik Gopalakrishnan, Seokhwan Kim, Lauren Stubel, Kate Bland, Arindam Mandal, Dilek Hakkani-Tür

Alexa Prize SocialBot Grand Challenge 3 Proceedings

2019

Building open domain conversational systems that allow users to have engaging conversations on topics of their choice is a challenging task. The Alexa Prize Socialbot Grand Challenge was launched in 2016 to tackle the problem of achieving natural, sustained, coherent and engaging open-domain dialogs. In the third iteration of the competition, university teams have moved the needle on the state of the art

Conversational AI

A unified optimization approach for CNN model inference on integrated GPUs

Leyuan Wang, Zhi Chen, Yizhi Liu, Yao Wang, Lianmin Zheng, Mu Li, Yida Wang

ICPP 2019

2019

Modern deep learning applications urge to push the model inference taking place at the edge devices for multiple reasons such as achieving shorter latency, relieving the burden of the network connecting to the cloud, and protecting user privacy. The Convolutional Neural Network (CNN) is one of the most widely used model family in the applications. Given the high computational complexity of the CNN models

Cloud and systems

Multi-domain goal-oriented dialogues (MultiDoGO): Strategies toward curating and annotating large scale dialogue data

Denis Peskov, Nancy Clarke, Jason Krone, Brigi Fodor, Yi Zhang, Adel Youssef, Mona Diab

2019

The need for high-quality, large-scale, goal-oriented dialogue datasets continues to grow as virtual assistants become increasingly widespread. However, existing publicly available datasets useful for this area are limited either in their size, linguistic diversity, domain coverage, or annotation granularity. We introduce the MultiDoGO dataset to overcome these limitations. With a total of over 65,000 dialogues

Conversational AI

Task2Vec

Alessandro Achille, Michael Lam, Rahul Tewari, Avinash Ravichandran, Subhranshu Maji, Charless Fowlkes, Stefano Soatto, Pietro Perona

2019

We introduce a method to generate vectorial representations of visual classification tasks that can be used to reason about the nature of those tasks and their relations. Given a dataset with ground-truth labels and a loss function, we process images through a “probe network” and compute an embedding based on estimates of the Fisher information matrix associated with the probe network parameters. This provides

Computer vision

OR Rl benchmarks

Bharathan Balaji, Jordan Bell-Masterson , Andreas Damianou, Pablo Garcia Moreno, Runfei Luo, Alvaro Maggiar, Balakrishnan (Murali) Narayanaswamy, Chun Ye

2019

Reinforcement Learning (RL) has achieved state-of-the-art results in domains such as robotics and games. We build on this previous work by applying RL algorithms to a selection of canonical online stochastic optimization problems with a range of practical applications: Bin Packing, Newsvendor, and Vehicle Routing. While there is a nascent literature that applies RL to these problems, there are no commonly

Machine learning

Contextual Query Rewrite (CQR) Dataset for spoken dialogue

Pushpendre Rastogi, Arpit Gupta, Tongfei Chen, Lambert Mathias

2019

Dialogue assistants are used by millions of people today to fulfill a variety of tasks. Such assistants also serve as a digital marketplace where any developer can build a domain-specific, task-oriented, dialogue agent offering a service such as booking cabs, ordering food, listening to music, shopping etc. Also, these agents may interact with each other, when completing a task on behalf of the user. Accomplishing

Conversational AI

Topic modeling with Wasserstein autoencoders

Feng Nan, Ran Ding, Ramesh Nallapati, Bing Xiang

2019

We propose a novel neural topic model in the Wasserstein autoencoders (WAE) framework. Unlike existing variational autoencoder based models, we directly enforce Dirichlet prior on the latent document-topic vectors. We exploit the structure of the latent space and apply a suitable kernel in minimizing the Maximum Mean Discrepancy (MMD) to perform distribution matching. We discover that MMD performs much

Conversational AI

Amazon SageMaker Debugger

Nathalie Rauschmayr, Vikas Kumar, Rahul Huilgol, Andrea Olgiati, Satadal Bhattacharjee, Nihal Harish, Vandana Kannan, Amol Lele, Anirudh Acharya, Jared Nielsen, Lakshmi Ramakrishnan, Ishaaq Chandy, Ishan Bhatt, Zhihan Li, Kohen Chia, Neelesh Dodda, Jiacheng Gu, Miyoung Choi, Balajee Nagarajan, Jeffrey Geevarghes, Denis Davydenko, Sifei Li, Lu Huang, Edward Kim, Tyler Hill, Krishnaram Kenthapadi

2019

Amazon SageMaker Debugger automates the debugging process of machine learning training jobs. From training jobs, Debugger allows you to run your own training script (Zero Script Change experience) using Debugger built-in features—Hook and Rule—to capture tensors, have flexibility to build customized Hooks and Rules for configuring tensors as you want, and make the tensors available for analysis by saving

Machine learning

MLIO

Can Balioglu, Rizwan Gilani

2019

MLIO is a high performance data access library for machine learning tasks with support for multiple data formats. It makes it easy for scientists to train models on their data without worrying about the format or where it's stored. Algorithm developers can also use MLIO to build production-quality algorithms that support a rich variety of data formats and provide helpful parsing and validation messages

Machine learning

Joint biased embeddings

Esma Balkir, Masha Naslidnyk, Dave Palfrey, Arpit Mittal, Sophie Durrant

2019

In this paper we study techniques to improve the performance of bilinear embedding methods for knowledge graph completion on large datasets, where at each epoch the model sees a very small percentage of the training data, and the number of generated negative examples for each positive example is limited to a small portion of the entire set of entities. We first present a heuristic method to infer the types

Machine learning

GluonTS: Probabilistic time series models in Python

Valentin Flunkert, Alexander Alexandrov, Jasper Zschiegner, Jan Gasthaus, David Salinas, Danielle Maddix Robinson, Yuyang (Bernie) Wang, Syama Rangapuram, Lorenzo Stella, Michael Bohlke-Schneider, Tim Januschowski

2019

We introduce Gluon Time Series (GluonTS)1, a library for deep-learning-based time series modeling. GluonTS simplifies the development of and experimentation with time series models for common tasks such as forecasting or anomaly detection. It provides all necessary components and tools that scientists need for quickly building new models, for efficiently running and analyzing experiments and for evaluating

Cloud and systems

Amazon Sustainability Data Initiative

David Roberts, Peter Schmiedeskamp, Steve Gillard, Erin Chu, Chris Stoner

2019

The Amazon Sustainability Data Initiative (ASDI) seeks to accelerate sustainability research and innovation by minimizing the cost and time required to acquire and analyze large sustainability datasets. ASDI supports innovators and researchers with the data, tools, and technical expertise they need to move sustainability to the next level. This repo contains docs, examples, and supporting material for ASDI

Sustainability

Micro-HTTP

Andreea Florescu, Jiang Liu, Luminita Voicu, Alexandru Cihodaru, Sebastien Boeuf, Adrian Costin Catangiu, George Pisaltu, Damien Stanton, Jonathan Woollett-Light, William Douglas, Alexandra Iordache, Ioana Chirca, Eisuke Matsushita, Tim Visée, Laura Loghin, Keyang Xie, Karthik Nedunchezhiyan, Bob Potter, Changwei Ge

2019

This is a minimal implementation of the HTTP/1.0 and HTTP/1.1 protocols. This HTTP implementation is stateless thus it does not support chunking or compression. The micro-http implementation is used in production by Firecracker. As micro-http uses std::os::unix this crates only supports Unix-like targets.

Cloud and systems

Topical-Chat

Karthik Gopalakrishnan, Behnam Hedayatnia, Qinlang Chen, Anna Gottardi, Sanjeev Kwatra, Anushree Venkatesh, Raefer Gabriel, Dilek Hakkani-Tür

2019

Building socialbots that can have deep, engaging open-domain conversations with humans is one of the grand challenges of artificial intelligence (AI). To this end, bots need to be able to leverage world knowledge spanning several domains effectively when conversing with humans who have their own world knowledge. Existing knowledge-grounded conversation datasets are primarily stylized with explicit roles

Conversational AI

RAMEN

Ke Tran

2019

Pre-trained models have demonstrated their effectiveness in many downstream natural language processing (NLP) tasks. The availability of multilingual pre-trained models enables zero-shot transfer of NLP tasks from high resource languages to low resource ones. However, recent research in improving pre-trained models focuses heavily on English. While it is possible to train the latest neural architectures

Conversational AI

FEVER 2.0 (2019)

James Thorne, Andreas Vlachos, Oana Cocarascu, Christos Christodoulopoulos, Arpit Mittal

2019

We present the results of the second Fact Extraction and VERification (FEVER2.0) Shared Task. The task challenged participants to both build systems to verify factoid claims using evidence retrieved from Wikipedia and to generate adversarial attacks against other participant’s systems. The shared task had three phases: building, breaking and fixing. There were 8 systems in the builder’s round, three of

Conversational AI

Search results

Work with us