Search - Amazon Science

18,803 results found

Sort

Domain adaptation with BERT-based domain classification and data selection

Xiaofei Ma, Zhiguo Wang, Ramesh Nallapati, Bing Xiang

EMNLP 2019 Workshop on DeepLo

2019

The performance of deep neural models can deteriorate substantially when there is a domain shift between training and test data. For example, the pre-trained BERT model can be easily fine-tuned with just one additional output layer to create a state-of-the-art model for a wide range of tasks. However, the fine-tuned BERT model suffers considerably at zero-shot when applied to a different domain. In this

Conversational AI
Improving answer selection and answer triggering using hard negatives

Sawan Kumar, Shweta Garg, Kartik Mehta, Nikhil Rasiwasia

EMNLP 2019

2019

In this paper, we establish the effectiveness of using hard negatives, coupled with a siamese network and a suitable loss function, for the tasks of answer selection and answer triggering. We show that the choice of sampling strategy is key for achieving improved performance on these tasks. Evaluating on recent answer selection datasets -- InsuranceQA, SelQA, and an internal QA dataset, we show that using

Conversational AI
Ajay Mishra
Sawsan Alqahtani
Efficient convolutional neural network for diacritic restoration

Sawsan Alqahtani, Ajay Mishra, Mona Diab

EMNLP 2019

2019

Diacritic restoration has gained importance with the growing need for machines to understand written texts. The task is typically modeled as a sequence labeling problem and currently Bidirectional Long Short Term Memory (BiLSTM) models provide state-of-the-art results. Recently, Bai et al. (2018) show the advantages of Temporal Convolutional Neural Networks (TCN) over Recurrent Neural Networks (RNN) for

Conversational AI
Improved color modeling in different color spaces

Xudong Han, Trevor Cohn, Philip Schulz

EMNLP 2019

2019

We present a model that grounds color comparative adjectives in 2 different color spaces. We ﬁnd that modiﬁers represented as vectors from reference colors to target colors show different behaviors in RGB and HSV color space. Based on this ﬁnding we design models that primarily improve modeling of color related modiﬁers, such as ”pinkish”. In experiments, we pre-train basic models in different color spaces

Cloud and systems
Dialog State Tracking: A Neural Reading Comprehension approach

Shuyang Gao, Abhishek Sethi, Sanchit Agarwal, Tagyoung Chung, Dilek Hakkani-Tür

SIGDIAL 2019

2019

Dialog state tracking is used to estimate the current belief state of a dialog given all the preceding conversation. Machine reading comprehension, on the other hand, focuses on building systems that read passages of text and answer questions that require some understanding of passages. We formulate dialog state tracking as a reading comprehension task to answer the question what is the state of the current

Related: Turning Dialogue Tracking into a Reading Comprehension Problem

Conversational AI
Topical-Chat: Towards knowledge-grounded open-domain conversations

Karthik Gopalakrishnan, Behnam Hedayatnia, Qinlang Chen, Anna Gottardi, Sanjeev Kwatra, Anushree Venkatesh, Raefer Gabriel, Dilek Hakkani-Tür

Interspeech 2019

2019

Building socialbots that can have deep, engaging open-domain conversations with humans is one of the grand challenges of artificial intelligence (AI). To this end, bots need to be able to leverage world knowledge spanning several domains effectively when conversing with humans who have their own world knowledge. Existing knowledge-grounded conversation datasets are primarily stylized with explicit roles

Related: Amazon releases data set of annotated conversations to aid development of socialbots

Conversational AI
Vincent Deuschle
End-to-end benchmarking of deep learning platforms

Vincent Deuschle, Alexander Alexandrov, Tim Januschowski

VLDB Technology Conference on Performance Evaluation and Benchmarking 2019

2019

With their capability to recognize complex patterns in data, deep learning models are rapidly becoming the most prominent set of tools for a broad range of data science tasks from image classification to natural language processing. This trend is supplemented by the availability of deep learning software platforms and modern hardware environments. We propose a declarative benchmarking framework to evaluate

Cloud and systems
HyST: A Hybrid Approach for Flexible and Accurate Dialogue State Tracking

Rahul Goel, Shachi Paul, Dilek Hakkani-Tür

Interspeech 2019

2019

Recent works on end-to-end trainable neural network based approaches have demonstrated state-of-the-art results on dialogue state tracking. The best performing approaches estimate a probability distribution over all possible slot values. However, these approaches do not scale for large value sets commonly present in real-life applications and are not ideal for tracking slot values that were not observed

Related: New Alexa Research on Task-Oriented Dialogue Systems

Conversational AI
Meta-Surrogate Benchmarking for Hyperparameter Optimization

Aaron Klein, Javier González, Zhenwen Dai, Frank Hutter, Neil Lawrence

NeurIPS 2019

2019

Despite the recent progress in hyperparameter optimization (HPO), available benchmarks that resemble real-world scenarios usually consist of a few and very large problem instances that are expensive to solve. This blocks researchers and practitioners from systematically running large-scale comparisons that are needed to draw statistically significant results. This work proposes a method to alleviate these

Cloud and systems
SIR beam selector for Amazon Echo devices audio front-end

Xianxian Zhang, Trausti Kristjansson, Philip Hilmes

SiPS 2019

2019

The Audio Front-End (AFE) is a key component in mitigating acoustic environmental challenges for far-field automatic speech recognition (ASR) on Amazon Echo family of products. A critical component of the AFE is the Beam Selector, which identifies which beam points to the target user. In this paper, we proposed a new SIR beam selector that utilizes subband-based signal-to-interference ratios to learn the

Conversational AI
Ahmed Elgammal
Unsupervised learning
Ji Zhang
Jiong Zhong
AutoAssist: A framework to accelerate training of deep neural networks

Jiong Zhong, Hsiang-Fu Yu, Inderjit S. Dhillon

NeurIPS 2019

2019

Deep Neural Networks (DNNs) have yielded superior performance in many contemporary applications. However, the gradient computation in a deep model with millions of instances leads to a lengthy training process even with modern GPU/TPU hardware acceleration. In this paper, we propose AutoAssist, a simple framework to accelerate training of a deep neural network. Typically, as the training procedure evolves

Cloud and systems
Video story question answering with character-centric scene parsing and question-aware temporal attention

Shijie Geng, Ji Zhang, Zuohui Fu, Hang Zhang, Ahmed Elgammal, Gerard de Melo, Dimitris Metaxas

arXiv

2019

With the exploding growth of videos, there is an increasing interests for automatic video understanding. Video Story Question Answering (VSQA) proves to be an effective way for benchmarking the comprehension ability of a model. Recent VSQA approaches merely extract visual features from the whole scene or detected objects in each frame. However, it is hard to claim a method really understands a video without

Computer vision
Dale Struble

Senior Manager, Applied Science

...

884

885

886

...

941

Search results

Work with us