Code and datasets

Strongly pretrained class incremental learning

Tz-Ying Wu, Gurumurthy Swaminathan, Zhizhong Li, Avinash Ravichandran, Nuno Vasconcelos, Rahul Bhotika, Stefano Soatto

2022

Class-incremental learning (CIL) has been widely studied under the setting of starting from a small number of classes (base classes). Instead, we explore an understudied real-world setting of CIL that starts with a strong model pre-trained on a large number of base classes. We hypothesize that a strong base model can provide a good representation for novel classes and incremental learning can be done with

Computer vision

Towards total recall in industrial anomaly detection

Karsten Roth, Latha Pemula, Joaquin Zepeda, Bernhard Schölkopf, Thomas Brox, Peter Gehler

2022

Being able to spot defective parts is a critical component in large-scale industrial manufacturing. A particular challenge that we address in this work is the cold-start problem: fit a model using nominal (non-defective) example images only. While handcrafted solutions per class are possible, the goal is to build systems that work well simultaneously on many different tasks automatically. The best performing

Computer vision

Hierarchical bayesian analysis

Yu Liu

2022

This package contains the Hierarchical Bayesian model to predict sample size for online activity. The bang package (https://cran.rstudio.com/web/packages/bang/index.html) was used and accelerated by modifying it to use sufficient statistics, and to only simulate from the posterior over the hyper-parameters. "misc.R", "beta_prior.R", "binom_beta.R", "hef.R" and "set_and_check_prior.R" are source files from

Economics

Quantum computing exploration for drug discovery on AWS

Yong Liu, Aoyu Zhang, Mengxin Zhu

2022

Quantum Computing Exploration for Drug Discovery on AWS (abbrev. QCEDD), an open-sourced solution customers can launch to design and run computational studies in the area of drug discovery, e.g. molecular docking and protein folding. With this AWS Solution, you can access quantum computers through Amazon Braket. The Amazon Braket Hybrid Job feature allows you to use classical computing and quantum computing

Quantum technologies

DosCond

Wei Jin, Xianfeng Tang, Haoming Jiang, Zheng Li, Danni (Danqing) Zhang, Jiliang Tang, Bing Yin, Artur Bekasov

2022

As training deep learning models on large dataset takes a lot of time and resources, it is desired to construct a small synthetic dataset with which we can train deep learning models sufficiently. There are recent works that have explored solutions on condensing image datasets through complex bi-level optimization. For instance, dataset condensation (DC) matches network gradients w.r.t. large-real data

Machine learning

Individual preference stability for clustering

Saba Ahmadi, Pranjal Awasthi, Samir Khuller, Matthaus Kleindessner, Jamie Morgenstern, Pattara Sukprasert, Ali Vakilian

2022

In this paper, we propose a natural notion of individual preference (IP) stability for clustering, which asks that every data point, on average, is closer to the points in its own cluster than to the points in any other cluster. Our notion can be motivated from several perspectives, including game theory and algorithmic fairness. We study several questions related to our proposed notion. We first show that

Machine learning

Few-shot fine-tuning for opinion summarization

Arthur Brazinskas, Ramesh Nallapati, Mohit Bansal, Markus Dreyer

2022

This repository contains the main codebase for the corresponding NAACL findings paper. In this work, we explored in-domain information storage to adapters by pre-training them on customer reviews via the leave-one-out objective. Further, we fine-tune the pre-trained adapters on a handful of summaries. This method yields state-of-the-art results in terms of ROUGE scores and reduces semantic mistakes in generated

Conversational AI

Multi-task pre-training for plug-and-play task-oriented dialogue system

Yixuan Su, Lei Shu, Elman Mansimov, Arshit Gupta, Deng Cai, Yi-An Lai, Yi Zhang

2022

Pre-trained language models have been recently shown to benefit task-oriented dialogue (TOD) systems. Despite their success, existing methods often formulate this task as a cascaded generation problem which can lead to error accumulation across different sub-tasks and greater data annotation overhead. In this study, we present PPTOD, a unified model that seamlessly supports both task-oriented dialogue understanding

Conversational AI

FactGraph: Evaluating factuality in summarization with semantic graph representations

Leonardo Ribeiro, Mengwen Liu, Iryna Gurevych, Markus Dreyer, Mohit Bansal

2022

We propose FACTGRAPH, a method that decomposes the document and the summary into structured meaning representations (MR), which are more suitable for factuality evaluation. MRs describe core semantic concepts and their relations, aggregating the main content in both document and summary in a canonical form, and reducing data sparsity. FACTGRAPH encodes such graphs using a graph encoder augmented with structure-aware

Conversational AI

DSE: Learning dialogue representations from consecutive utterances

Zhihan Zhou, Dejiao Zhang, Wei Xiao, Nicholas Dingwall, Xiaofei Ma, Andrew O. Arnold, Bing Xiang

2022

This repository contains the code for the paper: "Learning dialogue representations from consecutive utterances" (NAACL 2022).

Conversational AI

ReFinED

Tom Ayoola, Shubhi Tyagi, Joseph Fisher, Christos Christodoulopoulos, Andrea Pierleoni

2022

We introduce ReFinED, an efficient end-to-end entity linking model which uses fine-grained entity types and entity descriptions to perform linking. The model performs mention detection, fine-grained entity typing, and entity disambiguation for all mentions within a document in a single forward pass, making it more than 60 times faster than competitive existing approaches. ReFinED also surpasses state-of-the-art

Conversational AI

Meta-learning the difference

Zejiang Hou, Julian Salazar, George Polovets

2022

Our dynamic low-rank task-adaptive reparameterization (TARP) and model structure (TAMS) primitives are implemented as a Python library. pip install -e . The initial commit includes this README and the original codebases we build upon, listed below. Later commits isolate our contributions and demonstrate how the library is used, e.g., TARP and TAMS in a meta-learning the difference loop on top of a HuggingFace

Conversational AI

Code and datasets

More resources

Related content

Work with us