Search - Amazon Science

Salience rank: Efficient keyphrase extraction with topic modeling

ACL 2017

2017

Topical PageRank (TPR) uses latent topic distribution inferred by Latent Dirichlet Allocation (LDA) to perform ranking of noun phrases extracted from documents. The ranking procedure consists of running PageRank K times, where K is the number of topics used in the LDA model. In this paper, we propose a modification of TPR, called Salience Rank. Salience Rank only needs to run PageRank once and extracts

Conversational AI

Toward better reconstruction of style images with GANs

Alexander Lorbert, Nir Ben-Zvi, Arridhana Ciptadi, Eduard Oks, Ambrish Tyagi

KDD 2017

2017

Generative adversarial networks (GANs) have recently seen a surge of interest in learning an inverse mapping—projecting the data back to the latent space. This learned mapping allows for image reconstruction—encoding and decoding—from a compact latent space. The choice of loss function(s) in this framework plays a critical role in determining the quality of the reconstruction. In this paper we explore possible

Machine learning

Ranking and calibrating click-attributed purchases in performance display advertising

Sougata Chaudhuri, Abraham Bagherjeiran, James Liu

KDD 2017

2017

In performance display advertising, bidders compete on behalf of advertisers for ad impressions, that is, the opportunity to display relevant ads on a publisher website. We consider bidding on behalf of online retailers who buy ad impressions hoping to realize value only from purchases attributed from clicks. The bidder has a two stage problem. In the first stage, the bidder has to select a small subset

Machine learning

Recommending product sizes to customers

Vivek Sembium, Rajeev Rastogi, Atul Saroop, Srujana Merugu

RecSys 2017

2017

We propose a novel latent factor model for recommending product size fits {Small, Fit, Large} to customers. Latent factors for customers and products in our model correspond to their physical true size, and are learnt from past product purchase and returns data. The outcome for a customer, product pair is predicted based on the difference between customer and product true sizes, and efficient algorithms

Search and information retrieval

Two decades of recommender systems at Amazon.com

Brent Smith, Greg Linden

IEEE Internet Computing

2017

Amazon is well-known for personalization and recommendations, which help customers discover items they might otherwise not have found. In this update to our original article, we discuss some of the changes as Amazon has grown.

Search and information retrieval

Sustainability at scale: Towards bridging the intention-behavior gap with sustainable recommendations

Sabina Tomkins, Steven Isley, Ben London, Lise Getoor

RecSys 2017

2017

Finding sustainable products and evaluating their claims is a significant barrier facing sustainability-minded customers. Tools that reduce both these burdens are likely to boost the sale of sustainable products. However, it is difficult to determine the sustainability characteristics of these products — there are a variety of certifications and definitions of sustainability, and quality labeling requires

Search and information retrieval

A Shared Task on Bandit Learning for Machine Translation

Artem Sokolov, Julia Kreutzer, Kellen Sunderland, Pavel Danchenko, Witold Szymaniak, Hagen Fuerstenau, Stefan Riezler

WMT 2017

2017

We introduce and describe the results of a novel shared task on bandit learning for machine translation. The task was organized jointly by Amazon and Heidelberg University for the first time at the Second Conference on Machine Translation (WMT 2017). The goal of the task is to encourage research on learning machine translation from weak user feedback instead of human references or post-edits. On each of

Machine learning

Labeling topics with images using neural networks

Nikolaos Aletras, Arpit Mittal

ECIR 2017

2017

Topics generated by topic models are usually represented by lists of t terms or alternatively using short phrases or images. The current state-of-the-art work on labeling topics using images selects images by re-ranking a small set of candidates for a given topic. In this paper, we present a more generic method that can estimate the degree of association between any arbitrary pair of an unseen topic and

Conversational AI

Bayesian optimization with tree-structured dependencies

Rodolphe Jenatton, Cédric Archambeau, Javier González, Matthias Seeger

ICML 2017

2017

Bayesian optimization has been successfully used to optimize complex black-box functions whose evaluations are expensive. In many applications, like in deep learning and predictive analytics, the optimization domain is itself complex and structured. In this work, we focus on use cases where this domain exhibits a known dependency structure. The benefit of leveraging this structure is twofold: we explore

Operations research and optimization

Intent based relevance estimation from click logs

Prakash Mandayam Comar, Srinivasan Sengamedu, "SHS"

CIKM 2017

2017

Estimating the relevance of documents based on the user feedback is an essential component of search, retrieval and ranking problems. User click modeling in search has focused primarily on factoring out the position bias. It is easy to see that the query type (generic queries vs specific queries) and user intent (purchase vs exploration) also introduce a bias in the click signal. In other words, the results

Search and information retrieval

Cynical selection of language model training data

Amittai Axelrod

arXiv

2017

The Moore-Lewis method of “intelligent selection of language model training data” is very effective, cheap, efficient... and also has structural problems. (1) The method defines relevance by playing language models trained on the in-domain and the out-of-domain (or data pool) corpora against each other. This powerful idea – which we set out to preserve – treats the two corpora as the opposing ends of a

Conversational AI

Gini-regularized optimal transport with an application to spatio-temporal forecasting

Lucas Roberts, Leo Razoumov, Lin Su, Yuyang (Bernie) Wang

NeurIPS 2017

2017

Rapidly growing product lines and services require a finer-granularity forecast that considers geographic locales. However the open question remains, how to assess the quality of a spatio-temporal forecast?

Operations research and optimization

Probabilistic demand forecasting at scale

Joos-Hendrik Böse, Valentin Flunkert, Jan Gasthaus, Tim Januschowski, Dustin Lange, David Salinas, Sebastian Schelter, Matthias Seeger, Yuyang (Bernie) Wang

VLDB 2017

2017

We present a platform built on large-scale, data-centric machine learning (ML) approaches, whose particular focus is demand forecasting in retail. At its core, this platform enables the training and application of probabilistic demand forecasting models, and provides convenient abstractions and support functionality for forecasting problems. The platform comprises of a complex end-to-end machine learning

Information and knowledge management

Sockeye: A toolkit for neural machine translation

Felix Hieber, Tobias Domhan, Michael Denkowski, David Vilar, Artem Sokolov, Ann Clifton, Matt Post

arXiv

2017

We describe SOCKEYE, 1 an open-source sequence-to-sequence toolkit for Neural Machine Translation (NMT). SOCKEYE is a production-ready framework for training and applying models as well as an experimental platform for researchers. Written in Python and built on MXNET, the toolkit offers scalable training and inference for the three most prominent encoder-decoder architectures: attentional recurrent neural

Conversational AI

Conversational AI: The science behind the Alexa Prize

Ashwin Ram, Rohit Prasad, Chandra Khatri, Anushree Venkatesh, Raefer Gabriel, Qing Liu, Jeff Nunn, Behnam Hedayatnia, Ming Cheng, Ashish Nagar, Eric King, Kate Bland, Amanda Wartick, Yi Pan, Han Song, Sk Jayadevan, Gene Hwang, Art Pettigrue

Alexa Prize SocialBot Grand Challenge 1 Proceedings

2017

Conversational agents are exploding in popularity. However, much work remains in the area of social conversation as well as free-form conversation over a broad range of domains and topics. To advance the state of the art in conversational AI, Amazon launched the Alexa Prize, a 2.5-million-dollar university competition where sixteen selected university teams were challenged to build conversational agents

Conversational AI

Sockeye

Felix Hieber, Tobias Domhan, Michael Denkowski, David Vilar, Artem Sokolov, Ann Clifton, Matt Post

2017

Sockeye is an open-source sequence-to-sequence framework for neural machine translation (NMT) built on PyTorch. It implements distributed training and optimized inference for state-of-the-art models, powering Amazon Translate and other MT applications. Recent developments and changes are tracked in our CHANGELOG. For a quickstart guide to training a standard NMT model on any size of data, see the WMT 2014

Conversational AI

Alexa smart home resources

David Dai, Mike Maas, Deepak Gokhale, David Zhang, Delin Davis, Piradeep Kandasamy, Kaifei Lei, Abhinav Miglani, Jason Xie , Henri Yandell, Mike Maas, Phil Freo

2017

You can build smart home and other products that customers can control from millions of Alexa devices with just their voice. Expand your device’s capabilities with Alexa to create delightful experiences for lights, switches, thermostats, cameras, locks, and more. Plus, your Alexa-connected devices continue to become smarter with Alexa’s growing list of smart home capabilities and features that leverage

Conversational AI

Mila

University of Montreal

Meet the Mila team from the University of Montreal, a French-language public research university in Montreal, Quebec, Canada.

SlugBot (2017)

University of California, Santa Cruz

Meet the SluBot team from UC Santa Cruz, a public land-grant research university in Santa Cruz, California.

Roving Mind

University of Trento

Picture walking into your house after a long day. As you step in, your favorite music starts playing in the background and the lights dim to suit the weather outside.

Search results

Work with us