Search - Amazon Science

Learning when not to answer: A ternary reward structure for reinforcement learning based question answering

Frederic Godin, Anjishnu Kumar, Arpit Mittal

NAACL 2019, NeurIPS 2018

2018

In this paper, we investigate the challenges of using reinforcement learning agents for question-answering over knowledge graphs for real-world applications. We examine the performance metrics used by state-of-the-art systems and determine that they are inadequate for such settings. More specifically, they do not evaluate the systems correctly for situations when there is no answer available and thus agents

Machine learning

Deep Gaussian processes for multi-fidelity modeling

Kurt Cutajar, Mark Pullin, Andreas Damianou, Javier González, Neil Lawrence

NeurIPS 2018

2018

Multi-fidelity methods are prominently used when cheaply-obtained, but possibly biased and noisy, observations must be effectively combined with limited or expensive true data in order to construct reliable models. This arises in both fundamental machine learning procedures such as Bayesian optimization, as well as more practical science and engineering applications. In this paper we develop a novel multi-fidelity

Machine learning

Invariant representation learning for robust deep networks

Julian Salazar, Davis Liang, Zhiheng Huang, Zachary Lipton

NeurIPS 2018

2018

Deep neural networks are often brittle to superficial perturbations of their inputs; models that perform well offline on held-out data can still break under small amounts of naturally-occurring or adversarial shifts. We consider invariant representation learning (IRL), first proposed in the domain of speech recognition, as a simple, effective, and general extension to data augmentation. Rather than only

Machine learning

The Alexa Meaning Representation Language

Thomas Kollar, Danielle Berry, Lauren Stuart, Karolina Owczarzak, Tagyoung Chung, Lambert Mathias, Michael Kayser, Bradford Snow, Spyros Matsoukas

NAACL 2018

2018

This paper introduces a meaning representation for spoken language understanding. The Alexa meaning representation language (AMRL), unlike previous approaches, which factor spoken utterances into domains, provides a common representation for how people communicate in spoken language. AMRL is a rooted graph, links to a large-scale ontology, supports cross-domain queries, finegrained types, complex utterances

Conversational AI

DeClarE: Debunking fake news and false claims using evidence-aware deep learning

Kashyap Popat, Subhabrata Mukherjee, Andrew Yates, Gerhard Weikum

ACL 2018

2018

Misinformation such as fake news is one of the big challenges of our society. Research on automated fact-checking has proposed methods based on supervised learning, but these approaches do not consider external evidence apart from labeled training instances. Recent approaches counter this deficit by considering external sources related to a claim. However, these methods require substantial feature modeling

Conversational AI

LinkNBed: Multi-graph representation learning with entity linkage

Rakshit Trivedi, Bunyamin Sisman, Xin Luna Dong, Jun Ma, Christos Faloutsos

ACL 2018

2018

Knowledge graphs have emerged as an important model for studying complex multi-relational data. This has given rise to the construction of numerous large scale but incomplete knowledge graphs encoding information extracted from various resources. An effective and scalable approach to jointly learn over multiple graphs and eventually construct a unified graph is a crucial next step for the success of knowledge-based

Conversational AI

Device-directed Utterance Detection

Sri Harish Mallidi, Roland Maas, Spyros Matsoukas, Björn Hoffmeister

Interspeech 2018

2018

In this work, we propose a classifier for distinguishing device-directed queries from background speech in the context of interactions with voice assistants. Applications include rejection of false wake-ups or unintended interactions as well as enabling wake-word free followup queries. Consider the example interaction: “Computer, play music”, “Computer, reduce the volume”. In this interaction, the user

Conversational AI

A neural interlingua for multilingual machine translation

Yichao Lu, Phillip Keung, Faisal Ladhak, Shaonan Zhang, Vikas Bhardwaj, Jason Sun

ACL 2018

2018

We incorporate an explicit neural interlingua into a multilingual encoder-decoder neural machine translation (NMT) architecture. We demonstrate that our model learns a language-independent representation by performing direct zero-shot translation (without using pivot translation), and by using the source sentence embeddings to create an English Yelp review classifier that, through the mediation of the neural

Conversational AI

Automatic stance detection using end-to-end memory networks

Mitra Mohtarami, Ramy Baly, James Glass, Preslav Nakov, Lluís Marquez, Alessandro Moschitti

NAACL 2018

2018

We present an effective end-to-end memory network model that jointly (i) predicts whether a given document can be considered as relevant evidence for a given claim, and (ii) extracts snippets of evidence that can be used to reason about the factuality of the target claim. Our model combines the advantages of convolutional and recurrent neural networks as part of a memory network. We further introduce a

Conversational AI

Coherence-aware topic modeling

Ran Ding, Ramesh Nallapati, Bing Xiang

EMNLP 2018

2018

Topic models are evaluated based on their ability to describe documents well (i.e. low perplexity) and to produce topics that carry coherent semantic meaning. In topic modeling so far, perplexity is a direct optimization target. However, topic coherence, owing to its challenging computation, is not optimized for and is only evaluated after training. In this work, under a neural variational inference framework

Conversational AI

FEVER: a large-scale dataset for Fact Extraction and VERification

James Thorne, Andreas Vlachos, Christos Christodoulopoulos, Arpit Mittal

NAACL 2018

2018

In this paper we introduce a new publicly available dataset for verification against textual sources, FEVER: Fact Extraction and VERification. It consists of 185,445 claims generated by altering sentences extracted from Wikipedia and subsequently verified without knowledge of the sentence they were derived from. The claims are classified as SUPPORTED, REFUTED or NOTENOUGHINFO by annotators achieving 0.6841

Conversational AI

A Scalable Neural Shortlisting-Reranking Approach for Large-Scale Domain Classification in Natural Language Understanding

Young-Bum Kim, Dongchan Kim, Joo-Kyung Kim, Ruhi Sarikaya

NAACL 2018

2018

Intelligent personal digital assistants (IPDAs), a popular real-life application with spoken language understanding capabilities, can cover potentially thousands of overlapping domains for natural language understanding, and the task of finding the best domain to handle an utterance becomes a challenging problem on a large scale.

Conversational AI

Improved knowledge graph embeddings by using inferred entity types

Esma Balkir, Masha Naslidnyk, Dave Palfrey, Arpit Mittal, Sophie Durrant

NeurIPS 2018

2018

In this paper we study techniques to improve the performance of bilinear embedding methods for knowledge graph completion on large datasets, where at each epoch the model sees a very small percentage of the training data, and the number of generated negative examples for each positive example is limited to a small portion of the entire set of entities. We first present a heuristic method to infer the types

Machine learning

Contextual topic modeling for dialogue systems

Chandra Khatri, Rahul Goel, Behnam Hedayatnia, Angeliki Metallinou, Raefer Gabriel, Arindam Mandal

SLT 2018

2018

Accurate prediction of conversation topics can be a valuable signal for creating coherent and engaging dialog systems. In this work, we focus on context-aware topic classification methods for identifying topics in free-form human-chatbot dialogs. We extend previous work on neural topic classification and unsupervised topic keyword detection by incorporating conversational context and dialog act features

Conversational AI

On acquisition functions for active multi-source Bayesian quadrature

Alexandra Gessner, Maren Mahsereci, Javier González

NeurIPS 2018, UAI 2019

2018

Bayesian quadrature (BQ) is a sample efficient probabilistic numerical method to solve integrals of expensive-to-evaluate black-box functions, yet so far, active BQ learning schemes focus merely on the integrand itself as information source, and do not allow for information transfer from cheaper, related functions. Here, we set the scene for active learning in BQ when multiple related information sources

Machine learning

Simple large-scale relationextraction from unstructured text

Christos Christodoulopoulos, Arpit Mittal

LREC 2018

2018

Knowledge-based question answering relies on the availability of facts, the majority of which cannot be found in structured sources (e.g. Wikipedia info-boxes, Wikidata). One of the major components of extracting facts from unstructured text is Relation Extraction (RE). In this paper we propose a novel method for creating distant (weak) supervision labels for training a large-scale RE system. We also provide

Conversational AI

The Fact Extraction and VERification (FEVER) Shared Task

James Thorne, Andreas Vlachos, Oana Cocarascu, Christos Christodoulopoulos, Arpit Mittal

EMNLP 2018

2018

We present the results of the first Fact Extraction and VERification (FEVER) Shared Task. The task challenged participants to classify whether human-written factoid claims could be SUPPORTED or REFUTED using evidence retrieved from Wikipedia. We received entries from 23 competing teams, 19 of which scored higher than the previously published baseline. The best performing system achieved a FEVER score of

Conversational AI

Integrating stance detection and fact checking in a unified corpus

Ramy Baly, Mitra Mohtarami, James Glass, Lluís Marquez, Alessandro Moschitti, Preslav Nakov

NAACL 2018

2018

A reasonable approach for fact checking a claim involves retrieving potentially relevant documents from different sources (e.g., news websites, social media, etc.), determining the stance of each document with respect to the claim, and finally making a prediction about the claim’s factuality by aggregating the strength of the stances, while taking the reliability of the source into account. Moreover, a

Conversational AI

Supervised Domain Enablement Attention for Personalized Domain Classification

Joo-Kyung Kim, Young-Bum Kim

EMNLP 2018

2018

In large-scale domain classification for natural language understanding, leveraging each user’s domain enablement information, which refers to the preferred or authenticated domains by the user, with attention mechanism has been shown to improve the overall domain classification performance. In this paper, we propose a supervised enablement attention mechanism, which utilizes sigmoid activation for the

Conversational AI

A call for clarity in reporting BLEU Scores

Matt Post

WMT 2018

2018

The field of machine translation faces an under-recognized problem because of inconsistency in the reporting of scores from its dominant metric. Although people refer to “the” BLEU score, BLEU is in fact a parameterized metric whose values can vary wildly with changes to these parameters. These parameters are often not reported or are hard to find, and consequently, BLEU scores between papers cannot be

Conversational AI

Search results

Work with us