Customer-obsessed science
Research areas
-
May 26, 20265 min readHow to train language models to generate diverse, accurate reasoning paths using tokens that control distinct reasoning strategies.
-
-
May 14, 202616 min read
-
-
April 15, 20268 min read
Featured news
-
KDD 2023 Workshop on Mining and Learning with Graphs, WSDM 20242023Graph Neural Networks (GNNs) have demonstrated promising outcomes across various tasks, including node classification and link prediction. Despite their remarkable success in various high-impact applications, we have identified three common pitfalls in message passing for link prediction, especially within industrial settings. Particularly, in prevalent GNN frameworks (e.g., DGL and PyTorchGeometric), the
-
NeurIPS 2023 Workshop on Robot Learning2023Offline meta-reinforcement learning (OMRL) aims to generalize an agent’s knowledge from training tasks with offline data to a new unknown RL task with few demonstration trajectories. This paper proposes T3GDT: Three-tier tokens to Guide Decision Transformer for OMRL. First, our approach learns a global token from its demonstrations to summarize a RL task’s transition dynamic and reward pattern. This global
-
NeurIPS 20232023We present a framework for transfer learning that efficiently adapts a large basemodel by learning lightweight cross-attention modules attached to its intermediate activations. We name our approach InCA (Introspective-Cross-Attention) and show that it can efficiently survey a network’s representations and identify strong performing adapter models for a downstream task. During training, InCA enables training
-
KDD 2024, NeurIPS 2023 Workshop on Distribution Shifts (DistShifts)2023Pre-trained language models (PLMs) have seen tremendous success in text classification (TC) problems in the context of Natural Language Processing (NLP). In many real-world text classification tasks, the class definitions being learned do not remain constant but rather change with time - this is known as concept shift. Most techniques for handling concept shift rely on retraining the old classifiers with
-
NeurIPS 20232023Earth system forecasting has traditionally relied on complex physical models that are computationally expensive and require significant domain expertise. In the past decade, the unprecedented increase in spatiotemporal Earth observation data has enabled data-driven forecasting models using deep learning techniques. These models have shown promise for diverse Earth system forecasting tasks. However, they
Collaborations
View allWhether you're a faculty member or student, there are number of ways you can engage with Amazon.
View all