Machine learning

Developing algorithms and statistical models that computer systems use to perform tasks without explicit instructions, relying on patterns and inference instead.

PCL: Prompt-based continual learning for user modeling in recommender systems

Mingdai Yang, Fan Yang, Yanhui Guo, Shaoyuan Xu, Tianchen Zhou, Yetian Chen, Simone Shao, Jia (Kevin) Liu, Yan Gao

The Web Conference 2025

2025

User modeling in large e-commerce platforms aims to optimize user experiences by incorporating various customer activities. Traditional models targeting a single task often focus on specific business metrics, neglecting the comprehensive user behavior, and thus limiting their effectiveness. To develop more generalized user representations, some existing work adopts Multi-task Learning (MTL) approaches.

Machine learning
Uncertainty-aware fusion: An ensemble framework for mitigating hallucinations in large language models

Prasenjit Dey, Srujana Merugu, Sivaramakrishnan (Siva) Kaveri

The Web Conference 2025

2025

Large Language Models (LLMs) are known to hallucinate and generate non-factual outputs which can undermine user trust. Traditional methods to directly mitigate hallucinations, such as representation editing and contrastive decoding, often require additional training data and involve high implementation complexity. While ensemble-based approaches harness multiple LLMs to tap into the "wisdom of crowds",

Machine learning
Aligning to constraints for data-efficient language model customization

Fei Wang, Chao Shang, Shuai Wang, Sarthak Jain, Qiang Ning, Bonan Min, Yassine Benajiba, Vittorio Castelli, Dan Roth

NAACL 2025

2025

General-purpose language models (LMs) are aligned to diverse user intents, but fall short when it comes to specific applications. While finetuning is the default method for customized alignment, human annotations are often unavailable in various customization scenarios. Based on the observation that one of the main issues of LM customization is constraint adherence, we investigate the feasibility of using

Machine learning
End-to-end framework for novel datatype arithmetic verification

Qiuwen Lou, Bing Ji, Stevo Bailey, Nilabja Chattopadhyay, Deepak Shivarudrappa, Sankalp Dayal

DVCON 2025

2025

Machine Learning (ML) accelerators are increasingly adopting diverse datatypes and data formats, such as FP16 and microscaling, to optimize key performance metrics such as inference accuracy, latency and power consumption. However, hardware modules like the arithmetic units and signal processing blocks associated with these datatypes pose unique verification challenges. In this work, we present an end-to-end

Machine learning
ExPERT: Modeling human behavior under external stimuli aware personalized MTPP

Ritvik Vij, Subhendu Khatuya, Paramita Koley, Samik Datta, Niloy Ganguly

AAAI 2025

2025

Marked Temporal Point Process (MTPP) – the de-facto sequence model for continuous-time event sequences – historically employed for modeling human-generated action sequences, lack awareness of external stimuli. In this study, we propose a novel framework developed over Transformer Hawkes Process (THP) to incorporate external stimuli in a domain-agnostic manner. Furthermore, we integrate personalization into

Machine learning

georgeclerk/Getty Images

How a paper by three Oxford academics influenced AWS bias and explainability software

Staff writer

April 1, 2021

Why conditional demographic disparity matters for developers using SageMaker Clarify.

Machine learning
Credit: Oregon State University

Amazon Scholar has his eyes on the future of robot movement

Alan S. Brown

March 30, 2021

Learn how Bill Smart wants to simplify the ways that robots and people work together — and why waiting on a date one night changed his career path.

Robotics
MLSys 2021: Bridging the divide between machine learning and systems

Larry Hardesty

March 29, 2021

Amazon distinguished scientist and conference general chair Alex Smola on what makes MLSys unique — both thematically and culturally.

Machine learning
Courtesy of Stefano Ceri

How one computer scientist and his team aim to bring genome data search to the next level

Mariana Lenharo

March 23, 2021

Politecnico di Milano professor Stefano Ceri is working to integrate genomic datasets into a single accessible system with the support of an Amazon Machine Learning Research Award.

Search and information retrieval
Credit: Emma Waldron Trammell

How one intern’s research had real-world impact for Twitch moderators

Staff writer

March 16, 2021

Amanda Cullen, a PhD candidate in informatics at the University of California, Irvine, wanted to do work that had an impact outside of academia — she found an ideal opportunity at Twitch.

Machine learning
Credit: Glynis Condon

Working toward fairer machine learning

Michele Donini, Luca Oneto

March 10, 2021

Exploring and analyzing possible techniques to make ML algorithms capable of learning fairer models by utilizing empirical risk minimization theory.

Machine learning

Machine learning

Recent publications

Related content

Work with us