Search - Amazon Science

Unsupervised induction of linguistic categories with records of reading, speaking, and writing

Maria Barrett, Lea Frermann, Ana Valeria Gonzalez-Garduño, Anders Søgaard

NAACL 2018

2018

When learning POS taggers and syntactic chunkers for low-resource languages, different resources may be available, and often all we have is a small tag dictionary, motivating type-constrained unsupervised induction. Even small dictionaries can improve the performance of unsupervised induction algorithms. This paper shows that performance can be further improved by including data that is readily available

Conversational AI

Automatic stance detection using end-to-end memory networks

Mitra Mohtarami, Ramy Baly, James Glass, Preslav Nakov, Lluís Marquez, Alessandro Moschitti

NAACL 2018

2018

We present an effective end-to-end memory network model that jointly (i) predicts whether a given document can be considered as relevant evidence for a given claim, and (ii) extracts snippets of evidence that can be used to reason about the factuality of the target claim. Our model combines the advantages of convolutional and recurrent neural networks as part of a memory network. We further introduce a

Conversational AI

The Alexa Meaning Representation Language

Thomas Kollar, Danielle Berry, Lauren Stuart, Karolina Owczarzak, Tagyoung Chung, Lambert Mathias, Michael Kayser, Bradford Snow, Spyros Matsoukas

NAACL 2018

2018

This paper introduces a meaning representation for spoken language understanding. The Alexa meaning representation language (AMRL), unlike previous approaches, which factor spoken utterances into domains, provides a common representation for how people communicate in spoken language. AMRL is a rooted graph, links to a large-scale ontology, supports cross-domain queries, finegrained types, complex utterances

Conversational AI

DeClarE: Debunking fake news and false claims using evidence-aware deep learning

Kashyap Popat, Subhabrata Mukherjee, Andrew Yates, Gerhard Weikum

ACL 2018

2018

Misinformation such as fake news is one of the big challenges of our society. Research on automated fact-checking has proposed methods based on supervised learning, but these approaches do not consider external evidence apart from labeled training instances. Recent approaches counter this deficit by considering external sources related to a claim. However, these methods require substantial feature modeling

Conversational AI

A neural interlingua for multilingual machine translation

Yichao Lu, Phillip Keung, Faisal Ladhak, Shaonan Zhang, Vikas Bhardwaj, Jason Sun

ACL 2018

2018

We incorporate an explicit neural interlingua into a multilingual encoder-decoder neural machine translation (NMT) architecture. We demonstrate that our model learns a language-independent representation by performing direct zero-shot translation (without using pivot translation), and by using the source sentence embeddings to create an English Yelp review classifier that, through the mediation of the neural

Conversational AI

LinkNBed: Multi-graph representation learning with entity linkage

Rakshit Trivedi, Bunyamin Sisman, Xin Luna Dong, Jun Ma, Christos Faloutsos

ACL 2018

2018

Knowledge graphs have emerged as an important model for studying complex multi-relational data. This has given rise to the construction of numerous large scale but incomplete knowledge graphs encoding information extracted from various resources. An effective and scalable approach to jointly learn over multiple graphs and eventually construct a unified graph is a crucial next step for the success of knowledge-based

Conversational AI

Device-directed Utterance Detection

Sri Harish Mallidi, Roland Maas, Spyros Matsoukas, Björn Hoffmeister

Interspeech 2018

2018

In this work, we propose a classifier for distinguishing device-directed queries from background speech in the context of interactions with voice assistants. Applications include rejection of false wake-ups or unintended interactions as well as enabling wake-word free followup queries. Consider the example interaction: “Computer, play music”, “Computer, reduce the volume”. In this interaction, the user

Conversational AI

Coherence-aware topic modeling

Ran Ding, Ramesh Nallapati, Bing Xiang

EMNLP 2018

2018

Topic models are evaluated based on their ability to describe documents well (i.e. low perplexity) and to produce topics that carry coherent semantic meaning. In topic modeling so far, perplexity is a direct optimization target. However, topic coherence, owing to its challenging computation, is not optimized for and is only evaluated after training. In this work, under a neural variational inference framework

Conversational AI

CogCompNLP: Your Swiss army knife for NLP

Daniel Khashabi, Mark Sammons, Christos Christodoulopoulos, Bhargav Mangipudi, Tom Redman, Ben Zhou, Guanheng Luo, Shaoshi Ling, Dan Roth

LREC 2018

2018

Implementing a Natural Language Processing (NLP) system requires considerable engineering effort: creating data-structures to represent language constructs; reading corpora annotations into these data-structures; applying off-the-shelf NLP tools to augment the text representation; extracting features and training machine learning components; conducting experiments and computing performance statistics; and

Conversational AI

FEVER: a large-scale dataset for Fact Extraction and VERification

James Thorne, Andreas Vlachos, Christos Christodoulopoulos, Arpit Mittal

NAACL 2018

2018

In this paper we introduce a new publicly available dataset for verification against textual sources, FEVER: Fact Extraction and VERification. It consists of 185,445 claims generated by altering sentences extracted from Wikipedia and subsequently verified without knowledge of the sentence they were derived from. The claims are classified as SUPPORTED, REFUTED or NOTENOUGHINFO by annotators achieving 0.6841

Conversational AI

On acquisition functions for active multi-source Bayesian quadrature

Alexandra Gessner, Maren Mahsereci, Javier González

NeurIPS 2018, UAI 2019

2018

Bayesian quadrature (BQ) is a sample efficient probabilistic numerical method to solve integrals of expensive-to-evaluate black-box functions, yet so far, active BQ learning schemes focus merely on the integrand itself as information source, and do not allow for information transfer from cheaper, related functions. Here, we set the scene for active learning in BQ when multiple related information sources

Machine learning

Learning to segment inputs for NMT favors character-level processing

Julia Kreutzer, Artem Sokolov

IWSLT 2018

2018

Most modern neural machine translation (NMT) systems rely on presegmented inputs. Segmentation granularity importantly determines the input and output sequence lengths, hence the modeling depth, and source and target vocabularies, which in turn determine model size, computational costs of softmax normalization, and handling of out-of-vocabulary words. However, the current practice is to use static, heuristic-based

Conversational AI

MLZero: Towards zero touch machine learning

Tom Diethe, Tom Borchert, Eno Thereska, Borja de Balle Pigem, Cédric Archambeau, Neil Lawrence

NeurIPS 2018

2018

This paper describes a reference architecture for self-maintaining systems that can learn continually, as data arrives. In environments where data evolves, we need architectures that manage Machine Learning (ML) models in production, adapt to shifting data distributions, cope with outliers, retrain when necessary, and adapt to new tasks. This represents continual AutoML or Automatically Adaptive Machine

Cloud and systems

Unsupervised quality estimation without reference corpus for subtitle machine translation using word embeddings

Prabhakar Gupta, Shaktisingh Shekhawat, Keshav Kumar

ICSC 2018

2018

We demonstrate the potential for using aligned bilingual word embeddings to create an unsupervised method to evaluate machine translations without a need for a parallel translation corpus or reference corpus. We explain why movie subtitles differ from other text and share our experimental results conducted on them for four target languages (French, German, Portuguese and Spanish) with English-source subtitles

Conversational AI

Learning when not to answer: A ternary reward structure for reinforcement learning based question answering

Frederic Godin, Anjishnu Kumar, Arpit Mittal

NAACL 2019, NeurIPS 2018

2018

In this paper, we investigate the challenges of using reinforcement learning agents for question-answering over knowledge graphs for real-world applications. We examine the performance metrics used by state-of-the-art systems and determine that they are inadequate for such settings. More specifically, they do not evaluate the systems correctly for situations when there is no answer available and thus agents

Machine learning

Invariant representation learning for robust deep networks

Julian Salazar, Davis Liang, Zhiheng Huang, Zachary Lipton

NeurIPS 2018

2018

Deep neural networks are often brittle to superficial perturbations of their inputs; models that perform well offline on held-out data can still break under small amounts of naturally-occurring or adversarial shifts. We consider invariant representation learning (IRL), first proposed in the domain of speech recognition, as a simple, effective, and general extension to data augmentation. Rather than only

Machine learning

Deep Gaussian processes for multi-fidelity modeling

Kurt Cutajar, Mark Pullin, Andreas Damianou, Javier González, Neil Lawrence

NeurIPS 2018

2018

Multi-fidelity methods are prominently used when cheaply-obtained, but possibly biased and noisy, observations must be effectively combined with limited or expensive true data in order to construct reliable models. This arises in both fundamental machine learning procedures such as Bayesian optimization, as well as more practical science and engineering applications. In this paper we develop a novel multi-fidelity

Machine learning

Multiplicative tree-structured long short-term memory networks for semantic representations

Nam Khanh Tran, Weiwei Cheng

NAACL 2018

2018

Tree-structured LSTMs have shown advantages in learning semantic representations by exploiting syntactic information. Most existing methods model tree structures by bottomup combinations of constituent nodes using the same shared compositional function and often making use of input word information only. The inability to capture the richness of compositionality makes these models lack expressive power.

Conversational AI

A multi-objective rule optimizer with an application to risk management

Pietari Pulkkinen, Neetesh Tiwari, Akhil Kumar, Christopher Jones, Yan Zhang

ICMLA 2018

2018

Managing risk is important to any E-commerce merchant. Various machine learning (ML) models combined with a rule set as the decision layer is a common practice to manage the risks. Unlike the ML models that can be automatically refreshed periodically based on new risk patterns, rules are generally static and rely on manual updates. To tackle that, this paper presents a data-driven and automated rule optimization

Operations research and optimization

Effect of Data Reduction on Sequence-to-Sequence Neural TTS

Javier Latorre, Jakub Lachowicz, Jaime Lorenzo Trueba, Tom Merritt, Thomas Drugman, Srikanth Ronanki, Viacheslav Klimkov

ICASSP 2019

2018

Recent speech synthesis systems based on sampling from autoregressive neural networks models can generate speech almost undistinguishable from human recordings. However, these models require large amounts of data. This paper shows that the lack of data from one speaker can be compensated with data from other speakers. The naturalness of Tacotron2-like models trained on a blend of 5k utterances from 7 speakers

Conversational AI

Search results

Work with us