Machine learning

Developing algorithms and statistical models that computer systems use to perform tasks without explicit instructions, relying on patterns and inference instead.

Hector: An efficient programming and compilation framework for implementing relational graph neural networks in GPU architectures

Kun Wu, Mert Hidayetoğlu, Xiang Song, Sitao Zhang, Da Zheng, Israt Nisa, Wen-mei Hwu

ASPLOS 2024

2024

Relational graph neural networks (RGNNs) are graph neural networks with dedicated structures for modeling the different types of nodes and edges in heterogeneous graphs. While RGNNs have been increasingly adopted in many real-world applications due to their versatility and accuracy, they pose performance and system design challenges: inherent memory-intensive computation patterns, the gap between the programming

Machine learning
Token alignment via character matching for subword completion

Ben Athiwaratkun, Shiqi Wang, Mingyue Shang, Yuchen Tian, Zijian Wang, Sujan Gonugondla, Sanjay Krishna Gouda, Rob Kwiatkowski, Ramesh Nallapati, Bing Xiang

ACL Findings 2024

2024

Generative models, widely utilized in various applications, can often struggle with prompts corresponding to partial tokens. This struggle stems from tokenization, where partial tokens fall out of distribution during inference, leading to incorrect or nonsensical outputs. This paper examines a technique to alleviate the tokenization artifact on text completion in generative models, maintaining performance

Machine learning
GRAM: Generative retrieval augmented matching of data schemas in the context of data security

Xuanqing Liu, Chris (Luyang) Kong, Runhui Wang, Patrick Song, Austin Nevins, Henrik Johnson, Nimish Amlath, Davor Golac

KDD 2024

2024

Schema matching constitutes a pivotal phase in the data ingestion process for contemporary database systems. Its objective is to discern pairwise similarities between two sets of attributes, each associated with a distinct data table. This challenge emerges at the initial stages of data analytics, such as when incorporating a third-party table into existing databases to inform business insights. Given its

Machine learning
An efficient self-learning framework for interactive spoken dialog systems

Hitesh Tulsiani, David M. Chan, Shalini Ghosh, Garima Lalwani, Prabhat Pandey, Ankish Bansal, Sri Garimella, Ariya Rastrow, Björn Hoffmeister

ICML 2024

2024

Dialog systems, such as voice assistants, are expected to engage with users in complex, evolving conversations. Unfortunately, traditional automatic speech recognition (ASR) systems deployed in such applications are usually trained to recognize each turn independently and lack the ability to adapt to the conversational context or incorporate user feedback. In this work, we introduce a general framework

Conversational AI
Finite-time convergence and sample complexity of actor-critic multi-objective reinforcement learning

Tianchen Zhou, Fnu Hairi, Haibo Yang, Jia (Kevin) Liu, Tian Tong, Fan Yang, Michinari Momma, Yan Gao

ICML 2024

2024

Reinforcement learning with multiple, potentially conflicting objectives is pervasive in real-world applications, while this problem remains theoretically under-explored. This paper tackles the multi-objective reinforcement learning (MORL) problem and introduces an innovative actor-critic algorithm named MOAC which finds a policy by iteratively making trade-offs among conflicting reward signals. Notably

Machine learning

Amazon Halo Rise advances the future of sleep

Chirag Bhavsar, Wesley Hong

September 28, 2022

Built-in radar technology, deep domain adaptation for sleep stage classification, and low-latency incremental sleep tracking enable Halo Rise to deliver a seamless, no-contact way to help customers improve sleep.

Machine learning
The surprisingly subtle challenge of automating damage detection

Sean O'Neill

September 19, 2022

Why detecting damage is so tricky at Amazon’s scale — and how researchers are training robots to help with that gargantuan task.

Robotics
Courtesy of Maryam Aziz

Master’s student uses SURE opportunity to explore impact of machine learning

Staff writer

September 9, 2022

The alumna of the 2021 Columbia SURE Amazon cohort becomes the first Amazon MS Fellow at Columbia.

Machine learning
Automatically optimizing execution of dynamic tensor operations

Bojian Zheng, Yida Wang

September 8, 2022

New auto-scheduler speeds optimization process sixfold while improving performance of resulting code up to 70%.

Machine learning
Pinch-grasping robot handles items with precision

John Roach

September 7, 2022

Preliminary tests show a prototype pinch-grasping robot achieved a 10-fold reduction in damage on items such as books and boxes.

Robotics
Using data science to help improve NFL quarterback passing scores

Staff writer

August 26, 2022

Principal data scientist Elena Ehrlich uses her skills to help a wide variety of customers — including the National Football League.

Machine learning

Machine learning

Recent publications

Related content

Work with us