Recent publications
-
ICSE 20232023Static analysis tools detect a wide range of code defects, including code quality issues, security vulnerabilities, operational risks, and best-practice violations. Creating and maintaining a set of high-quality static analysis rules that detect misuses of popular libraries and SDKs across multiple languages is challenging. One of the mechanisms for inferring static analysis rules is by leveraging frequently
-
ICSE 20232023Static application security testing (SAST) tools have found broad adoption in modern software development workflows. These tools employ a variety of static analysis rules to generate recommendations on how to improve the code of an application. Every recommendation consumes the time of the engineer that is investigating it, so it is important to measure how useful these rules are in the long term. But what
-
ASPLOS 20232023As deep learning models nowadays are widely adopted by both cloud services and edge devices, reducing the latency of deep learning model inferences becomes crucial to provide efficient model serving. However, it is challenging to develop efficient tensor programs for deep learning operators due to the high complexity of modern accelerators (e.g., NVIDIA GPUs and Google TPUs) and the rapidly growing number
-
VLDB 20232023Existing general purpose frameworks for gigantic model training, i.e., dense models with billions of parameters, cannot scale efficiently on cloud environment with various networking conditions due to large communication overheads. In this paper, we propose MiCS, which Minimizes the Communication Scale to bring down communication overhead. Specifically, by decreasing the number of participants in a communication
-
With an ever-increasing number of smart edge devices with computation and communication constraints, Federated Learning (FL) is a promising paradigm for learning from distributed devices and their data. Typical approaches to FL aim to learn a single model that simultaneously performs well for all clients. But such an approach may be ineffective when the clients’ data distributions are heterogeneous. In
Related content
-
December 12, 2022Vice president of ML and AI Services says more than 100,000 customers are doing machine learning on AWS.
-
November 03, 2022Tim Kraska, who joined Amazon this summer to build the new Learned Systems research group, explains the power of “instance optimization”.
-
October 21, 2022Prioritizing predictability over efficiency, adapting data partitioning to traffic, and continuous verification are a few of the principles that help ensure stability, availability, and efficiency.
-
September 26, 2022Contiguous parameter management and prefetched activation offloading expand the MiCS tool kit.
-
August 17, 2022In tests, new approach is 15 to 18 times as fast as predecessors.
-
August 12, 2022Li and co-authors honored for creating an antenna design that was essential to the growth of mobile devices.
-
July 27, 2022Nafea Bshara, AWS vice president and distinguished engineer, discusses Annapurna Lab’s path to silicon success; Annapurna co-founder was a featured speaker at AWS Silicon Innovation Day virtual event.
-
June 27, 2022A new distributed-training library achieves near-linear efficiency in scaling from tens to hundreds of GPUs.
-
May 19, 2022Amazon Athena reduces query execution time by 14% by eliminating redundant operations.
-
May 18, 2022Two authors of Amazon Redshift research paper that will be presented at leading international forum for database researchers reflect on how far the first petabyte scale cloud data warehouse has advanced since it was announced ten years ago.
-
April 12, 2022Reducing the energy of ion beams used for editing eliminates the need for “sacrificial” areas between electrical components and improves precision.
-
April 04, 2022Thanks to a set of simple abstractions, models with different architectures can be integrated and optimized for particular hardware accelerators.
-
March 23, 2022Amazon researchers optimize the distributed-training tool to run efficiently on the Elastic Fabric Adapter network interface.
-
January 27, 2022The switch to WebAssembly increases stability, speed.
-
January 18, 2022Amazon DynamoDB was introduced 10 years ago today; one of its key contributors reflects on its origins, and discusses the 'never-ending journey' to make DynamoDB more secure, more available and more performant.
-
December 09, 2021Gari Clifford, the chair of the Department of Biomedical Informatics at Emory University and an Amazon Research Award recipient, wants to transform healthcare.
-
November 10, 2021Being able to understand and relate to the needs of working scientists is key to her success.
-
April 29, 2021Amazon Scholar Aravind Srinivasan on the importance of machine learning for real-time and offline resource management.
-
April 19, 2021In a pilot study, an automated code checker found about 100 possible errors, 80% of which turned out to require correction.
-
April 12, 2021In tests, a new way to allocate virtual machines across servers outperforms baselines by 10%.
-
March 29, 2021Amazon distinguished scientist and conference general chair Alex Smola on what makes MLSys unique — both thematically and culturally.