Classification algorithms

BinoML: Supervised ranking for automatic building labeling

ACM SIGSPATIAL 2022 1st International Workshop on Spatial Big Data and AI for Industrial Applications

2022

Building numbers shown on building outlines of a map are important information for guiding delivery associates to the correct building of a package’s recipient. Intuitively, the more labeled buildings are present in our map, the less likely to misplace an order in addition to other benefits such as delivery efficiency as drivers get better visual cues about building positions. Although there are free and

Machine learning

AutoGDA: Automated graph data augmentation for node classification

Tong Zhao, Xianfeng Tang, Danni (Danqing) Zhang, Haoming Jiang, Nikhil Rao, Yiwei Song, Pallav Agrawal, Karthik Subbian, Bing Yin, Meng Jiang

Learning on Graphs Conference

2022

Graph data augmentation has been used to improve generalizability of graph machine learning. However, by only applying fixed augmentation operations on entire graphs, existing methods overlook the unique characteristics of communities which naturally exist in the graphs. For example, different communities can have various degree distributions and homophily ratios. Ignoring such discrepancy with unified

Machine learning

DORE: Document ordered relation extraction based on generative framework

Qipeng Guo, Yuqing Yang, Hang Yan, Xipeng Qiu, Zheng Zhang

EMNLP 2022

2022

In recent years, there is a surge of generation-based information extraction work, which allows a more direct use of pre-trained language models and efficiently captures output dependencies. However, previous generative methods using lexical representation do not naturally fit document-level relation extraction (DocRE) where there are multiple entities and relational facts. In this paper, we investigate

Conversational AI

REDTab: A relation extraction dataset for knowledge extraction from web tables

Siffi Singh, Alham Fikri Aji, Gaurav Singh, Christos Christodoulopoulos

COLING 2022

2022

Relational web-tables are significant sources of structural information that are widely used for relation extraction and population of facts into knowledge graphs. To transform the webtable data into knowledge, we need to identify the relations that exist between column pairs. Currently, there are only a handful of publicly available datasets with relations annotated against natural web-tables. Most datasets

Conversational AI

Mixture of domain experts for language understanding: An analysis of modularity, task performance, and memory tradeoffs

Benjamin Kleiner, Jack G. M. FitzGerald, Haidar Khan, Gokhan Tur

SLT 2022

2022

One of the limitations of large-scale machine learning models is that they are difficult to adjust after deployment without significant re-training costs. In this paper, we focus on NLU and the needs of virtual assistant systems to continually update themselves through time to support new functionality. Specifically, we consider the tasks of intent classification (IC) and slot filling (SF), which are fundamental

Conversational AI

Open world classification with adaptive negative samples

Ke Bai, Guoyin Wang, Jiwei Li, Sunghyun Park, Sungjin Lee, Puyang Xu, Ricardo Henao, Lawrence Carin

EMNLP 2022

2022

Open world classification is a task in natural language processing with key practical relevance and impact. Since the open or unknown category data only manifests in the inference phase, finding a model with a suitable decision boundary accommodating for the identification of known classes and discrimination of the open category is challenging. The performance of existing models is limited by the lack of

Conversational AI

Assaying out-of-distribution generalization in transfer learning

Florian Wenzel, Andrea Dittadi, Peter Gehler, Carl-Johann Simon-Gabriel, Max Horn, Dominik Zietlow, David Kernert, Chris Russell, Thomas Brox, Bernt Schiele, Bernhard Schölkopf, Francesco Locatello

NeurIPS 2022

2022

Since out-of-distribution generalization is a generally ill-posed problem, various proxy targets (e.g., calibration, adversarial robustness, algorithmic corruptions, invariance across shifts) were studied across different research programs resulting in different recommendations. While sharing the same aspirational goal, these approaches have never been tested under the same experimental conditions on real

Computer vision

Are two heads the same as one? Identifying disparate treatment in fair neural networks

Michael Lohaus, Matthaus Kleindessner, Krishnaram Kenthapadi, Francesco Locatello, Chris Russell

NeurIPS 2022

2022

We show that deep networks trained to satisfy demographic parity often do so through a form of race or gender awareness, and that the more we force a network to be fair, the more accurately we can recover race or gender from the internal state of the network. Based on this observation, we investigate an alternative fairness approach: we add a second classification head to the network to explicitly predict

Computer vision

Benchmarking the covariate shift robustness of open-world intent classification approaches

Sopan Khosla, Rashmi Gangadharaiah

AACL 2022

2022

Task-oriented dialog systems deployed in real-world applications are often challenged by out-of-distribution queries. These systems should not only reliably detect utterances with unsupported intents (semantic shift), but also generalize to covariate shift (supported intents from unseen distributions). However, none of the existing benchmarks for open-world intent classification focus on the second aspect

Conversational AI

Deep classification of frequently-changing activities from GPS trajectories

Emre Eftelioglu, Gil Wolff, sai krishna tejaswi nimmagadda, Vishal Kumar, Amber Roy Chowdhury

ACM SIGSPATIAL 2022

2022

Classifying trip modalities, i.e. driving, walking, etc., from GPS trajectories is one of the fundamental tasks for urban mobility analytics. It can be used for efficient route planning, human activity recognition, and public transportation design where understanding the time and location of transitioning to different modalities may provide additional insights. Informally, given a GPS trajectory consisting

Machine learning

Classification algorithms

Work with us