Deep learning

Benchmarking multimodal AutoML for tabular data with text fields

Xingjian Shi, Jonas Mueller, Nick Erickson, Mu Li, Alex Smola

NeurIPS 2021 Workshop on Datasets and Benchmarks Track

2021

We consider the use of automated supervised learning systems for data tables that not only contain numeric/categorical columns, but one or more text fields as well. Here we assemble 18 multimodal data tables that each contain some text fields and stem from a real business application. Our publicly-available benchmark enables researchers to comprehensively evaluate their own methods for supervised learning

Machine learning

Self-supervised learning with data augmentations provably isolates content from style

Julius von Kugelgen, Yash Sharma, Luigi Gresele, Wieland Brendel, Bernhard Schölkopf, Michel Besserve, Francesco Locatello

NeurIPS 2021

2021

Self-supervised representation learning has shown remarkable success in a number of domains. A common practice is to perform data augmentation via hand-crafted transformations intended to leave the semantics of the data invariant. We seek to understand the empirical success of this approach from a theoretical perspective. We formulate the augmentation process as a latent variable model by postulating a

Machine learning

Dynamic inference with neural interpreters

Waleed Gondal, Nasim Rahaman, Shruti Joshi, Peter Gehler, Yoshua Bengio, Francesco Locatello, Bernhard Schölkopf

NeurIPS 2021

2021

Modern neural network architectures can leverage large amounts of data to generalize well within the training distribution. However, they are less capable of systematic generalization to data drawn from unseen but related distributions, a feat that is hypothesized to require compositional reasoning and reuse of knowledge. In this work, we present Neural Interpreters, an architecture that factorizes inference

Machine learning

Question rewriting for open-domain conversational QA: Best practices and limitations

Marco Del Tredici, Gianni Barlacchi, Xiaoyu Shen, Weiwei Cheng, Adrià de Gispert

CIKM 2021

2021

Open-domain conversational QA (ODCQA) calls for effective question rewriting (QR), as the questions in a conversation typically lack proper context for the QA model to interpret. In this paper, we compare two types of QR approaches, generative and expansive QR, in end-to-end ODCQA systems with recently released QReCC and OR-QuAC benchmarks. While it is common practice to apply the same QR approach for both

Conversational AI

FinLex: An effective use of word embeddings for financial lexicon generation

Sanjiv Das, Michele Donini, Bilal Zafar, John He, Krishnaram Kenthapadi

The Journal of Finance and Data Science (JFDS)

2021

We present a simple and effective methodology for the generation of lexicons (word lists) that may be used in natural language scoring applications. In particular, in the finance industry, word lists have become ubiquitous for sentiment scoring. These have been derived from dictionaries such as the Harvard Inquirer and require manual curation. Here, we present an automated approach to the curation of lexicons

Conversational AI

Adaptive load balancing for parallel GNN training

Qidong Su, Minjie Wang, Da Zheng, Zheng Zhang

MLSys 2021 Workshop on Neural Networks and Systems

2021

The recent emergence of demand for running Graph Neural Networks (GNNs) on giant real world graphs requires more scalable system designs. Due to the sparse and irregular connections a graph has, parallel GNN training encounters the problem of load imbalance among workers. In this paper, we show that previous techniques based on graph partitioning is insufficient to address the load imbalance caused by GNN

Cloud and systems

DP-KB: Data programming with knowledge bases improves transformer fine tuning for answer sentence selection

Nic Jedema, Thuy Vu, Manish Gupta, Alessandro Moschitti

NeurIPS 2021 Workshop on Databases and AI (DBAI)

2021

While transformers demonstrate impressive performance on many knowledge intensive (KI) tasks, their ability to serve as implicit knowledge bases (KBs) remains limited, as shown on several slot-filling, question-answering (QA), fact verification, and entity-linking tasks. In this paper, we implement an efficient, data-programming technique that enriches training data with KB-derived context and improves

Conversational AI

Magic pyramid: Accelerating inference with early exiting and token pruning

Xuanli He, Iman Keivanloo, Yi Xu, Xiang He, Belinda Zeng, Santosh Rajagopalan, Trishul Chilimbi

NeurIPS 2021 Workshop on Efficient Natural Language and Speech Processing

2021

Pretraining and then finetuning of large language models is one of the commonly used approaches to achieve good performance in natural language processing (NLP) tasks. However most pre-trained models have large memory footprint and low inference speed. Deploying such large models to applications with latency constraint is challenging. In this work, we focus on accelerating the inference via conditional

Machine learning

VidTr: Video transformer without convolutions

Yanyi Zhang, Xinyu (Arthur) Li, Chunhui Liu, Bing Shuai, Yi Zhu, Biagio Brattoli, Hao Chen, Ivan Marsic, Joe Tighe

ICCV 2021

2021

We introduce Video Transformer (VidTr) with separable attention for video classification. Comparing with commonly used 3D networks, VidTr is able to aggregate spatiotemporal information via stacked attentions and provide better performance with higher efficiency. We first introduce the vanilla video transformer and show that transformer module is able to perform spatio-temporal modeling from raw pixels,

Computer vision

Question answering using web lists

Anoop R Katti, Kai Hui, Adrià de Gispert, Hagen Fuerstenau

CIKM 2021

2021

There are many natural questions that are best answered with a list. We address the problem of answering such questions using lists that occur on the Web, i.e. List Question Answering (ListQA). The diverse formats of lists on theWeb makes this task challenging. We describe state-of-the-art methods for list extraction and ranking, that also consider the text surrounding the lists as context. Due to the lack

Conversational AI

Deep learning

Work with us