Computer vision

Helping devices see and understand our visual world.

CrossNorm and SelfNorm for generalization under distribution shifts

Zhiqiang Tang, Yunhe Gao, Yi Zhu, Zhi Zhang, Mu Li, Dimitris Metaxas

ICCV 2021

2021

Traditional normalization techniques (e.g., Batch Normalization and Instance Normalization) generally and simplistically assume that training and test data follow the same distribution. As distribution shifts are inevitable in real-world applications, well-trained models with previous normalization methods can perform badly in new environments. Can we develop new normalization methods to improve generalization

Computer vision
Uniform sampling over episode difficulty

Sébastien M. R. Arnold, Guneet Singh Dhillon, Avinash Ravichandran, Stefano Soatto

NeurIPS 2021

2021

Episodic training is a core ingredient of few-shot learning to train models on tasks with limited labelled data. Despite its success, episodic training remains largely understudied, prompting us to ask the question: what is the best way to sample episodes? In this paper, we first propose a method to approximate episode sampling distributions based on their difficulty. Building on this method, we perform

Computer vision
A first look towards one-shot object detection with SPOT for data-efficient learning

Ria Chakraborty, Madhur Popli, Rachit Lamba, Rishi Verma

NeurIPS 2021 Workshop on Data-Centric AI

2021

In this work we discuss One-Shot Object Detection, a challenging task of detecting novel objects in a target scene using a single reference image called a query. To address this challenge we introduce SPOT (Surfacing POsitions using Transformers), a novel transformer based end-to-end architecture which uses synergy between the provided query and target images using a learnable Robust Feature Matching module

Computer vision
Novel ensemble diversification methods for open-set scenarios

Miriam Farber, Roman Goldenberg, George Leifman, Gal Novich

WACV 2022

2021

We revisit existing ensemble diversification approaches and present two novel diversification methods tailored for open-set scenarios. The first method uses a new loss, designed to encourage models disagreement on outliers only, thus alleviating the intrinsic accuracy-diversity trade-off. The second method achieves diversity via automated feature engineering, by training each model to disregard input features

Computer vision
Visual relationship detection using part-and-sum transformers with composite queries

Qi Dong, Zhuowen Tu, Haofu Liao, Yuting Zhang, Vijay Mahadevan, Stefano Soatto

ICCV 2021

2021

Computer vision applications such as visual relationship detection and human object interaction can be formulated as a composite (structured) set detection problem in which both the parts (subject, object, and predicate) and the sum (triplet as a whole) are to be detected in a hierarchical fashion. In this paper, we present a new approach, denoted Part-and-Sum detection Transformer (PST), to perform end-to-end

Computer vision

Courtesy Alla Sheffer

Amazon Scholar Alla Sheffer uses computer graphics to drive improvements in garment sizing and fitting

Douglas Gantenbein

February 24, 2021

Complex algorithms promise to fundamentally change a craft that still relies almost entirely on handwork.

Computer vision
Credit: Glynis Condon

Growing generative adversarial networks, layer by layer

Yuting Zhang

February 16, 2021

A new approach that grows networks dynamically promises improvements over GANs with fixed architectures or predetermined growing strategies.

Machine learning
Prime Video's work on sports field registration, recap/intro detection

Raffay Hamid

January 15, 2021

Two papers at WACV propose neural models for enhancing video-streaming experiences.

Computer vision
Credit: Photos courtesy of the speakers

Amazon at WACV: Computer vision is more than labeling pixels

Larry Hardesty

January 8, 2021

Amazon distinguished scientist Gérard Medioni on the complexities of “understanding your environment through visual input”.

Computer vision
Credit: Glynis Condon

The science behind Amazon's new StyleSnap for Home feature

Liz Sheeley

December 22, 2020

StyleSnap for fashion and home features are made possible by use of multiple convolutional neural networks.

Search and information retrieval
How a ‘Think Big’ idea helped bring Lookout for Vision to life

Staff writer

December 3, 2020

Learn about the science behind the new machine learning product for manufacturers — and how a unique approach solved a complex problem.

Machine learning

Computer vision

Recent publications

Related content

Work with us