Customer-obsessed science
Research areas
-
May 15, 20265 min readA new scaling law that relates particular architectural choices to loss helps identify models that improve throughput by up to 47% with no loss of accuracy.
-
May 14, 202616 min read
-
-
April 15, 20268 min read
Featured news
-
UAI 20232023Item-to-Item (I2I) recommendation is an important function that suggests replacement or complement options for an item based on their functional similarities or synergies. To capture such item relationships effectively, the recommenders need to understand why subsets of items are co-viewed or co-purchased by the customers. Graph-based models, such as graph neural networks (GNNs), provide a natural framework
-
KDD 2023 Workshop on Causal Inference and Machine Learning in Practice: Use cases for Product, Brand, Policy and Beyond2023We introduce OpportunityFinder, a code-less framework for performing a variety of causal inference studies with panel data for non-expert users. In its current state, OpportunityFinder only requires users to provide raw observational data and a configuration file. A pipeline is then triggered that inspects/processes data, chooses the suitable algorithm(s) to execute the causal study. It returns the causal
-
RSS 20232023We study the autonomous exploration task in indoor environments for the mobile ground robot. We propose a three-stage exploration strategy: viewpoint generation, viewpoint scoring, and viewpoint selection, to make the algorithm agnostic to the robot’s planning and control modules. In particular, we propose the Learning to Explore (L2E) framework, which formulates the scoring and selection stages as a learning
-
ECAI 20232023Numerous examples in the literature proved that deep learning models have the ability to work well with multimodal data. Recently, CLIP has enabled deep learning systems to learn shared latent spaces between images and text descriptions, with outstanding zero- or few-shot results in downstream tasks. In this paper we explore the same idea proposed by CLIP but applied to the speech domain, where the phonetic
-
ICCV 20232023Reading text in real-world scenarios often requires understanding the context surrounding it, especially when dealing with poor-quality text. However, current scene text recognizers are unaware of the bigger picture as they operate on cropped text images. In this study, we harness the representative capabilities of modern vision-language models, such as CLIP, to provide scene-level information to the crop-based
Collaborations
View allWhether you're a faculty member or student, there are number of ways you can engage with Amazon.
View all