Publications

Amazon is a great place to practice science and have real business impact, but that's only one part of the story. Our scientists continue to publish, teach, and engage with the worldwide research community, sharing insights across diverse disciplines from machine learning to operations research. Through these contributions, we're advancing scientific knowledge while developing innovations that address complex challenges for customers and society.

4,361 results found

Sort

Selfie: Reflections on TLS 1.3 with PSK

Nir Drucker, Shay Gueron

ACL 2022 Workshop on NLP for Conversational AI, Journal of Cryptography

2020

TLS 1.3 allows two parties to establish a shared session key from an out-of-band agreed Pre Shared Key (PSK). The PSK is used to mutually authenticate the parties, under the assumption that it is not shared with others. This allows the parties to skip the certificate verification steps, saving bandwidth, communication rounds, and latency. We identify a security vulnerability in this TLS 1.3 path, by showing

Conversational AI
Feature relevance quantification in explainable AI: A causal problem

Dominik Janzing, Lenon Vogel, Patrick Blöbaum

AISTATS 2020

2020

We discuss promising recent contributions on quantifying feature relevance using Shapley values, where we observed some confusion on which probability distribution is the right one for dropped features. We argue that the confusion is based on not carefully distinguishing between observational and interventional conditional probabilities and try a clarification based on Pearl’s seminal work on causality.

Machine learning
Scalable feature selection for (multitask) gradient boosted trees

Cuize Han, Nikhil Rao, Daria Sorokina, Karthik Subbian

AISTATS 2020

2020

Gradient Boosted Decision Trees (GBDTs) are widely used for building ranking and relevance models in search and recommendation. Considerations such as latency and interpretability dictate the use of as few features as possible to train these models. Feature selection in GBDT models typically involves heuristically ranking the features by importance and selecting the top few, or by performing a full backward

Related: Speeding training of decision trees

Search and information retrieval
READ: Recursive autoencoders for document layout generation

Akshay Gadi Patil, Omri Ben-Eliezer, Or Perel, Hadar Averbuch-Elor

CVPR 2020 Workshop on Text and Documents in the Deep Learning Era

2020

Layout is a fundamental component of any graphic design. Creating large varieties of plausible document layouts can be a tedious task, requiring numerous constraints to be satisfied, including local ones relating different semantic elements and global constraints on the general appearance and spacing. In this paper, we present a novel framework, coined READ, for REcursive Autoencoders for Document layout

Computer vision
Rethinking zero-shot video classification: end-to-end training for realistic applications

Biagio Brattoli, Joe Tighe, Fedor Zhdanov, Pietro Perona, Krzysztof Chalupka

CVPR 2020

2020

Trained on large datasets, deep learning (DL) can accurately classify videos into hundreds of diverse classes.However, video data is expensive to annotate. Zero-shot learning (ZSL) proposes one solution to this problem. ZSL trains a model once, and generalizes to new tasks whose classes are not present in the training dataset. We propose the first end-to-end algorithm for ZSL in video classification. Our

Related: Video classifiers learn to recognize actions they've never seen

Computer vision
Image based virtual try-on network from unpaired data

Assaf Neuberger, Eran Borenstein, Bar Hilleli, Eduard Oks, Sharon Alpert

CVPR 2020

2020

This paper presents a new image-based virtual try-on approach (Outfit-VITON) that helps visualize how a composition of clothing items selected from various reference images form a cohesive outfit on a person in a query image. Our algorithm has two distinctive properties. First, it is inexpensive, as it simply requires a large set of single (non-corresponding) images (both real and catalog) of people wearing

Related: How computer vision will help Amazon customers shop online

Computer vision
Combining detection and tracking for human pose estimation in videos

Manchen Wang, Joe Tighe, Davide Modolo

CVPR 2020

2020

We propose a novel top-down approach that tackles the problem of multi-person human pose estimation and tracking in videos. In contrast to existing to-down approaches,our method is not limited by the performance of its person detector and can predict the poses of person instances not localized. It achieves this capability by propagating known person locations forward and backward in time and searching for

Computer vision
Recursive Template-based Frame Generation for Task Oriented Dialog

Rashmi Gangadharaiah, Balakrishnan (Murali) Narayanaswamy

ACL 2020

2020

The Natural Language Understanding (NLU) component in task oriented dialog systems processes a user’s request and converts it into structured information that can be consumed by downstream components such as the Dialog State Tracker (DST). This information is typically represented as a semantic frame that captures the intent and slot-labels provided by the user. We first show that such a shallow representation

Conversational AI
SOCKEYE 2: A toolkit for neural machine translation

Felix Hieber, Tobias Domhan, Michael Denkowski, David Vilar

EAMT 2020

2020

We present SOCKEYE2, a modernized and streamlined version of the SOCKEYE neural machine translation (NMT) toolkit.New features include a simplified code base through the use of MXNet’s GluonAPI, a focus on state-of-the-art model architectures, and distributed mixed precision training. These improvements result in faster training and inference, higher automatic metric scores, and a shorter path from research

Conversational AI
ScrabbleGAN: Semi-supervised varying length handwritten text generation

Sharon Fogel, Hadar Averbuch-Elor , Sarel Cohen, Shai Mazor, Roee Litman

CVPR 2020, WiDS TLV 2020

2020

Optical character recognition (OCR) systems performance have improved significantly in the deep learning era. This is especially true for handwritten text recognition (HTR), where each author has a unique style, unlike printed text, where the variation is smaller by design. That said, deep learning based HTR is limited, as in every other task, by the number of training examples. Gathering data is a challenging

Computer vision
TXtract: Taxonomy-aware knowledge extraction for thousands of product categories

Giannis Karamanolakis, Jun Ma, Xin Luna Dong

ACL 2020

2020

Extracting structured knowledge from product profiles is crucial for various applications in e-Commerce. State-of-the-art approaches for knowledge extraction were each designed for a single category of product, and thus do not apply to real-life e-Commerce scenarios, which often contain thousands of diverse categories.This paper proposes TXtract, a taxonomy-aware knowledge extraction model that applies

Information and knowledge management
SeqVAT: Virtual adversarial training for semi-supervised sequence labeling

Luoxin Chen, Weitong Ruan, Xinyue Liu, Jianhua Lu

ACL 2020

2020

Virtual adversarial training (VAT) is a powerful technique to improve model robustness in both supervised and semi-supervised settings. It is effective and can be easily adopted on lots of image classification and text classification tasks. However, its benefits to sequence labeling tasks such as named entity recognition (NER) have not been shown as significant, mostly, because the previous approach can

Related: Using unlabeled data to improve sequence labeling

Conversational AI
DeepRacer: Autonomous racing platform for experimentation with Sim2Real reinforcement learning

Bharathan Balaji, Sunil Mallya, Sahika Genc, Saurabh Gupta, Leo Dirac, Vineet Khare, Gourav Roy, Tao Sun, Yunzhe Tao, Brian Townsend, Eddie Calleja, Sunil Muralidhara, Dhanasekar Karuppasamy

ICRA 2020

2020

DeepRacer is a platform for end-to-end experimentation with RL and can be used to systematically investigate the key challenges in developing intelligent control systems. Using the platform, we demonstrate how a 1/18th scale car can learn to drive autonomously using RL with a monocular camera. It is trained in simulation with no additional tuning in the physical world and demonstrates: 1) formulation and

Robotics
Fast polynomial inversion for post quantum QC-MDPC cryptography

Nir Drucker, Shay Gueron, Dusan Kostic

CSCML 2020

2020

The NIST PQC standardization project evaluates multiple new designs for post-quantum Key Encapsulation Mechanisms (KEMs). Some of them present challenging tradeoffs between communication band-width and computational overheads. An interesting case is the set of QC-MDPC based KEMs. Here, schemes that use the Niederreiter framework require only half the communication bandwidth compared to schemes that use

Security, privacy, and abuse prevention
Learning robust models for e-commerce product search

Thanh Nguyen, Nikhil Rao, Karthik Subbian

ACL 2020

2020

Showing items that do not match search query intent degrades customer experience in e-commerce. These mismatches result from counterfactual biases of the ranking algorithms toward noisy behavioral signals such as clicks and purchases in the search logs. Mitigating the problem requires a large labeled dataset, which is expensive and time-consuming to obtain. In this paper, we develop a deep, end-to-end model

Related: Adversarial training improves product discovery

Search and information retrieval

...

254

255

256

...

291

Publications

Latest news

Work with us