Customer-obsessed science
Research areas
-
December 8, 20258 min readNew service lets customers mix their own data with the data used to train Amazon Nova at each major stage of model development, enabling deep domain understanding while preventing "catastrophic forgetting".
-
December 5, 20256 min read
-
-
-
November 20, 20254 min read
Featured news
-
Physical Review Research2021The disjointness of a stabilizer code is a quantity used to constrain the level of the logical Clifford hierarchy attainable by transversal gates and constant-depth quantum circuits. We show that for any positive integer constant c, the problem of calculating the c-disjointness, or even approximating it to within a constant multiplicative factor, is NP-complete. We provide bounds on the disjointness for
-
EMNLP 20212021There is an increasing interest in continuous learning (CL), as data privacy is becoming a priority for real-world machine learning applications. Meanwhile, there is still a lack of academic NLP benchmarks that are applicable for realistic CL settings, which is a major challenge for the advancement of the field. In this paper we discuss some of the unrealistic data characteristics of public datasets, study
-
NeurIPS 2021 Workshop on Distribution Shifts2021We devise a coreset selection method based on the idea of gradient matching: the gradients induced by the coreset should match, as closely as possible, those induced by the original training dataset. We evaluate the method in the context of continual learning, where it can be used to curate a rehearsal memory. Our method performs strong competitors such as reservoir sampling across a range of memory sizes
-
IEEE Xplore2021This report explains why cloud computing supports a variety of power system businesses and summarizes the latest cloud adoption use cases in the power industry. It includes the benefits and risks of moving to the cloud while suggesting risk mitigation strategies at t he same time. It also provides valuable guidelines and suggestions for power industry professionals who are considering cloud solutions yet
-
The Journal of Financial Data Science Summer2021The authors enhance pretrained language models with Securities and Exchange Commission filings data to create better language representations for features used in a predictive model. Specifically, they train RoBERTa class models with additional financial regulatory text, which they denote as a class of RoBERTa-Fin models. Using different datasets, the authors assess whether there is material improvement
Collaborations
View allWhether you're a faculty member or student, there are number of ways you can engage with Amazon.
View all