Machine learning

Developing algorithms and statistical models that computer systems use to perform tasks without explicit instructions, relying on patterns and inference instead.

AdaSelection: Accelerating deep learning training through adaptive data subsampling

Minghe Zhang, Chaosheng Dong, Jinmiao Fu, Tianchen Zhou, Jia Liang, Jia (Kevin) Liu, Bo Liu, Michinari Momma, Bryan Wang, Yan Gao, Yi Sun

KDD 2024

2024

In this paper, we introduce AdaSelection, an adaptive sub-sampling method to identify the most informative sub-samples within each minibatch to speed up the training of large-scale deep learning models without sacrificing model performance. Our method is able to flexibly combines an arbitrary number of baseline sub-sampling methods incorporating the method-level importance and intra-method sample-level

Machine learning
RoseLoRA: Row and column-wise sparse low-rank adaptation of pre-trained language model for knowledge editing and fine-tuning

Haoyu Wang, Tianci Liu, Ruirui Li, Monica Cheng, Tuo Zhao, Jing Gao

EMNLP 2024

2024

Pre-trained language models, trained on largescale corpora, demonstrate strong generalizability across various NLP tasks. Finetuning these models for specific tasks typically involves updating all parameters, which is resource-intensive. Parameter-efficient finetuning (PEFT) methods, such as the popular LoRA family, introduce low-rank matrices to learn only a few parameters efficiently. However, during

Conversational AI
CodeFort: Robust training for code generation models

Yuhao Zhang, Shiqi Wang, Haifeng Qian, Zijian Wang, Mingyue Shang, Linbo Liu, Sanjay Krishna Gouda, Baishakhi Ray, Murali Krishna Ramanathan, Xiaofei Ma, Anoop Deoras

EMNLP 2024

2024

Code generation models are not robust to small perturbations, which often lead to incorrect generations and significantly degrade the performance of these models. Although improving the robustness of code generation models is crucial to enhancing user experience in real-world applications, existing research efforts do not address this issue. To fill this gap, we propose CodeFort, a framework to improve

Machine learning
Prompt-tuned muti-task taxonomic transformer (PTMTTaxoFormer)

Rajashekar Vasantha, Nhan Nguyen, Yue Zhang

EMNLP 2024

2024

Hierarchical Text Classification (HTC) is a sub-class of multi-label classification. It is challenging because the hierarchy typically has a large number of diverse topics. Existing methods for HTC fall within two categories, local methods (a classifier for each level, node, or parent) or global methods (a single classifier for everything). Local methods are computationally expensive, whereas global methods

Conversational AI
FLIRT: Feedback loop in-context red teaming

Ninareh Mehrabi, Palash Goyal, Christophe Dupuy, Qian Hu, Shalini Ghosh, Richard Zemel, Kai-Wei Chang, Aram Galstyan, Rahul Gupta

EMNLP 2024

2024

Warning: this paper contains content that may be inappropriate or offensive. As generative models become available for public use in various applications, testing and analyzing vulnerabilities of these models has become a priority. In this work, we propose an automatic red teaming framework that evaluates a given black-box model and exposes its vulnerabilities against unsafe and inappropriate content generation

Conversational AI

georgeclerk/Getty Images

How a paper by three Oxford academics influenced AWS bias and explainability software

Staff writer

April 1, 2021

Why conditional demographic disparity matters for developers using SageMaker Clarify.

Machine learning
Credit: Oregon State University

Amazon Scholar has his eyes on the future of robot movement

Alan S. Brown

March 30, 2021

Learn how Bill Smart wants to simplify the ways that robots and people work together — and why waiting on a date one night changed his career path.

Robotics
MLSys 2021: Bridging the divide between machine learning and systems

Larry Hardesty

March 29, 2021

Amazon distinguished scientist and conference general chair Alex Smola on what makes MLSys unique — both thematically and culturally.

Machine learning
Courtesy of Stefano Ceri

How one computer scientist and his team aim to bring genome data search to the next level

Mariana Lenharo

March 23, 2021

Politecnico di Milano professor Stefano Ceri is working to integrate genomic datasets into a single accessible system with the support of an Amazon Machine Learning Research Award.

Search and information retrieval
Credit: Emma Waldron Trammell

How one intern’s research had real-world impact for Twitch moderators

Staff writer

March 16, 2021

Amanda Cullen, a PhD candidate in informatics at the University of California, Irvine, wanted to do work that had an impact outside of academia — she found an ideal opportunity at Twitch.

Machine learning
Credit: Glynis Condon

Working toward fairer machine learning

Michele Donini, Luca Oneto

March 10, 2021

Exploring and analyzing possible techniques to make ML algorithms capable of learning fairer models by utilizing empirical risk minimization theory.

Machine learning

Machine learning

Recent publications

Related content

Work with us