Machine learning

Developing algorithms and statistical models that computer systems use to perform tasks without explicit instructions, relying on patterns and inference instead.

Bidirectional long-range parser for sequential data understanding

George Leotescu, Daniel Voinea, Alin-Ionut Popa

ICLR 2024 Workshop on Data-centric Machine Learning Research

2024

The transformer is a powerful data-modeling framework responsible for remarkable performance on a wide range of tasks. However, transformers are limited in terms of scalability as it is suboptimal and inefficient to process long-sequence data. To this purpose we introduce BLRP (Bidirectional Long-Range Parser), a novel and versatile attention mechanism designed to increase performance and efficiency on

Machine learning
DataLore: Can a large language model find all lost scrolls in a data repository?

Yuze Lou, Chuan Lei, Xiao Qin, Zichen Wang, Christos Faloutsos, Rishita Anubhai, Huzefa Rangwala

ICDE 2024

2024

How can we effectively generate missing data transformations among tables in a data repository? Multiple versions of the same tables are generated from the iterative process when data scientists and machine learning engineers fine-tune their ML pipelines, making incremental improvements. This process often involves data transformation and augmentation that produces an augmented table based on its base version

Information and knowledge management
Krylov cubic regularized Newton: A subspace second-order method with dimension-free convergence rate

Ruichen Jiang, Parameswaran Raman, Shoham Sabach, Aryan Mokhtari, Mingyi Hong, Volkan Cevher

AISTATS 2024

2024

Second-order optimization methods, such as cubic regularized Newton methods, are known for their rapid convergence rates; nevertheless, they become impractical in high-dimensional problems due to their substantial memory requirements and computational costs. One promising approach is to execute second-order updates within a lower-dimensional subspace, giving rise to subspace second-order methods. However

Machine learning
Data‑driven budget allocation of retail media by ad product, funnel metric, and brand size

Vivian Qin, Koen Pauwels, Bobby Zhou

Journal of Business Research

2024

Sellers on online marketplaces such as Amazon.com use a variety of retail and retail media advertising services to improve their brand performance, including awareness, consideration, and revenue. But how can they measure their progress and drive these metrics? For 122,000 brands, we measure Amazon shoppers’ brand awareness, consideration, and purchases and test how they change with ad and retail actions

Machine learning
NETINFOF framework: Measuring and exploiting network usable information

Jeremy Lee, Haiyang Yu, Jian Zhang, Vassilis N. Ioannidis, Xiang Song, Soji Adeshina, Da Zheng, Christos Faloutsos

ICLR 2024

2024

Given a node-attributed graph, and a graph task (link prediction or node classification), can we tell if a graph neural network (GNN) will perform well? More specifically, do the graph structure and the node features carry enough usable information for the task? Our goals are (1) to develop a fast tool to measure how much information is in the graph structure and in the node features, and (2) to exploit

Machine learning

Adversarial training produces synthetic data for machine learning

Rahul Gupta

March 21, 2019

Sentiment analysis is the attempt, computationally, to determine from someone’s words how he or she feels about something. It has a host of applications, in market research, media analysis, customer service, and product recommendation, among other things. Sentiment classifiers are typically machine learning systems, and any given application of sentiment analysis may suffer from a lack of annotated data for training purposes.

Conversational AI
Machine-labeled data + artificial noise = better speech recognition

Minhua Wu

March 20, 2019

Although deep neural networks have enabled accurate large-vocabulary speech recognition, training them requires thousands of hours of transcribed data, which is time-consuming and expensive to collect. So Amazon scientists have been investigating techniques that will let Alexa learn with minimal human involvement, techniques that fall in the categories of unsupervised and semi-supervised learning.

Conversational AI
To correct imbalances in training data, don’t oversample: Cluster

Ming Sun

March 11, 2019

In experiments involving sound recognition, technique reduces error rate by 15% to 30%.

Machine learning
Innovations from the 2018 Alexa Prize

Behnam Hedayatnia

March 5, 2019

The 2018 Alexa Prize featured eight student teams from four countries, each of which adopted distinctive approaches to some of the central technical questions in conversational AI. We survey those approaches in a paper we released late last year, and the teams themselves go into even greater detail in the papers they submitted to the latest Alexa Prize Proceedings. Here, we touch on just a few of the teams’ innovations.

Conversational AI
AI tools let Alexa Prize participants focus on science

Anushree Venkatesh

February 27, 2019

To ensure that Alexa Prize contestants can concentrate on dialogue systems — the core technology of socialbots — Amazon scientists and engineers built a set of machine learning modules that handle fundamental conversational tasks and a development environment that lets contestants easily mix and match existing modules with those of their own design.

Conversational AI
Updating neural networks to recognize new categories, with minimal retraining

Alessandro Moschitti

January 30, 2019

Many of today’s most popular AI systems are, at their core, classifiers. They classify inputs into different categories: this image is a picture of a dog, not a cat; this audio signal is an instance of the word “Boston”, not the word “Seattle”; this sentence is a request to play a video, not a song. But what happens if you need to add a new class to your classifier — if, say, someone releases a new type of automated household appliance that your smart-home system needs to be able to control?

Machine learning

Machine learning

Recent publications

Related content

Work with us