-
EuroSys 20232023Multi-task model training has been adopted to enable a single deep neural network model (often a large language model) to handle multiple tasks (e.g., question answering and text summarization). Multi-task training commonly receives input sequences of highly different lengths due to the diverse contexts of different tasks. Padding (to the same sequence length) or packing (short examples into long sequences
-
Cell phone coverage and high-speed service gaps persist in rural areas in sub-Saharan Africa, impacting public access to mobile-based financial, educational, and humanitarian services. Improving maps of telecommunications infrastructure can help inform strategies to eliminate gaps in mobile coverage. Deep neural networks, paired with remote sensing images, can be used for object detection of cell towers
-
ACM BuildSys 2023 Workshop on RLEM2023Building operations account for a significant portion of global emissions, contributing approximately 28% of global greenhouse gas emissions, according to the International Energy Agency. With the anticipated increase in cooling demand due to rising global temperatures, the optimization of rooftop units (RTUs) in buildings becomes crucial for reducing energy consumption and associated emissions. We focus
-
IEEE/ACM International Symposium on Microarchitecture (MICRO ’23)2023Achieving high performance in machine learning workloads is a crucial yet difficult task. To achieve high runtime performance on hardware platforms such as GPUs, graph-based executions such as CUDA graphs are often used to eliminate CPU runtime overheads by submitting jobs in the granularity of multiple kernels. However, many machine learning workloads, especially dynamic deep neural networks (DNNs) with
-
2023 IEEE International Conference on Cloud Networking (CloudNet)2023Nowadays, VPN technology is widely used in cloud and hybrid network communication that makes use of algorithms and tunneling to meet different security requirements. However, existing cloud VPN gateways often lack advanced monitoring capabilities and struggle to identify and resolve network connectivity and performance issues. Hence, LPMLP adapted Secure cloud VPN Gateway with Network Monitoring and Issue
Related content
-
November 10, 2023Curating the neural-architecture search space and taking advantage of human intuition reduces latency on real-world applications by up to 55%.
-
October 25, 2023Novel “checkpointing” scheme that uses CPU memory reduces the time wasted on failure recovery by more than 92%.
-
October 16, 2023Former Amazon applied science intern Margarida Ferreira conducts research to make complex cloud resources easier to manage.
-
August 21, 2023How Linghui Luo's research helps ensure code is checked and ready to deploy.
-
July 13, 2023Based on a survey of thousands of machine learning practitioners, a new CodeGuru extension addresses common problems, such as code cell execution order, incorrect API calls, and security.
-
June 28, 2023Ongoing collaboration includes Amazon joining the UW Center for the Future of Cloud Infrastructure.
-
June 20, 2023SIGMOD paper by Amazon researchers and collaborators presents flexible data definition language that enables rapid development of complex graph databases.
-
December 12, 2022Vice president of ML and AI Services says more than 100,000 customers are doing machine learning on AWS.
-
November 03, 2022Tim Kraska, who joined Amazon this summer to build the new Learned Systems research group, explains the power of “instance optimization”.
-
October 21, 2022Prioritizing predictability over efficiency, adapting data partitioning to traffic, and continuous verification are a few of the principles that help ensure stability, availability, and efficiency.
-
September 26, 2022Contiguous parameter management and prefetched activation offloading expand the MiCS tool kit.
-
August 17, 2022In tests, new approach is 15 to 18 times as fast as predecessors.
-
August 12, 2022Li and co-authors honored for creating an antenna design that was essential to the growth of mobile devices.
-
July 27, 2022Nafea Bshara, AWS vice president and distinguished engineer, discusses Annapurna Lab’s path to silicon success; Annapurna co-founder was a featured speaker at AWS Silicon Innovation Day virtual event.
-
June 27, 2022A new distributed-training library achieves near-linear efficiency in scaling from tens to hundreds of GPUs.
-
May 19, 2022Amazon Athena reduces query execution time by 14% by eliminating redundant operations.
-
May 18, 2022Two authors of Amazon Redshift research paper that will be presented at leading international forum for database researchers reflect on how far the first petabyte scale cloud data warehouse has advanced since it was announced ten years ago.
-
April 12, 2022Reducing the energy of ion beams used for editing eliminates the need for “sacrificial” areas between electrical components and improves precision.
-
April 04, 2022Thanks to a set of simple abstractions, models with different architectures can be integrated and optimized for particular hardware accelerators.
-
March 23, 2022Amazon researchers optimize the distributed-training tool to run efficiently on the Elastic Fabric Adapter network interface.
-
January 27, 2022The switch to WebAssembly increases stability, speed.