Customer-obsessed science
-
April 16, 2024First model to work across a wide range of products uses a second U-Net encoder to capture fine-grained product details.
-
April 11, 2024This year’s papers address topics such as speech enhancement, spoken-language understanding, dialogue, paralinguistics, and pitch estimation.
-
April 11, 2024An animation that projects traffic fluctuations onto the U.S. map offers an example of how the Supply Chain Optimization Technologies team uses data visualization to glean insights.
-
-
May 7 - 11, 2024
-
May 13 - 17, 2024
-
May 20 - 25, 2024
-
April 26, 2024
Awardees, who represent 51 universities in 15 countries, have access to Amazon public datasets, along with AWS AI/ML services and tools.
-
CVPR 2024 Workshop on Multimodal Learning and Applications2024In e-commerce applications, vision-language multimodal transformer models play a pivotal role in product search. The key to successfully training a multimodal model lies in the alignment quality of image-text pairs in the dataset. However, the data in practice is often automatically collected with minimal manual intervention. Hence the alignment of image-text pairs is far from ideal. In e-commerce, this
-
CVPR 2024 Workshop on "What is Next in Multimodal Foundation Models?"2024This paper presents novel benchmarks for evaluating vision-language models (VLMs) in zero-shot recognition, focusing on granularity and specificity. Although VLMs ex-cel in tasks like image captioning, they face challenges in open-world settings. Our benchmarks test VLMs’ consistency in understanding concepts across semantic granularity levels and their response to varying text specificity. Findings show
-
CVPR 2024 Workshop on Computer Vision for Fashion, Art, and Design2024Virtual try-on and product personalization have become increasingly important in modern online shopping, high-lighting the need for accurate body measurement estimation. Although previous research has advanced in estimating 3D body shapes from RGB images, the task is inherently ambiguous as the observed scale of human subjects in the images depends on two unknown factors: capture distance and body dimensions
News and features
-
April 09, 2024How the team behind Echo Frames delivered longer battery life and improved sound quality inside the slim form factor of a pair of eyeglasses.
-
March 27, 2024The submission period opens March 27 and closes on May 7.
-
March 21, 2024The principal economist and his team address unique challenges using techniques at the intersection of microeconomics, statistics, and machine learning.