-
Picture Coding Symposium 20242024Deep learning-based video quality assessment (deep VQA) has demonstrated significant potential in surpassing conventional metrics, with promising improvements in terms of correlation with human perception. However, the practical deployment of such deep VQA models is often limited due to their high computational complexity and large memory requirements. To address this issue, we aim to significantly reduce
-
Picture Coding Symposium 20242024Professionally generated content (PGC) streamed online can contain visual artefacts that degrade the quality of user experience. These artefacts arise from different stages of the streaming pipeline, including acquisition, post-production, compression, and transmission. To better guide streaming ex-perience enhancement, it is important to detect specific artefacts at the user end in the absence of a pristine
-
WSDM 20242024Review of non-taxable products is an important internal audit which is carried out by majority of e-commerce stakeholders. This process usually cross checks the initial taxability assignments to avoid any unnecessary penalties incurred to the companies during the actual audits by the respective state compliance teams/tax departments. In order to handle millions of products sold online on e-commerce websites
-
2024As embodied agents learn to interact, it is crucial for them to understand when, what, and to whom they should respond. While advances in natural-language processing and speech technologies have enabled conversational agents to focus on what to respond, they still struggle to determine when and to whom they should respond. In this paper, we address the addressee detection (Talking-To-Me, TTM) problem under
-
ICIPACV 20242024Anomaly detection, also referred to as one-class classification, plays a crucial role in identifying product images that deviate from the expected distribution. This study introduces Data-centric Anomaly Detection with Diffusion Models (DCADDM), presenting a systematic strategy for data collection and further diversifying the data with image generation via diffusion models. The algorithm addresses data
Related content
-
March 04, 2022Detectors for block corruption, audio artifacts, and errors in audio-video synchronization are just three of Prime Video’s quality assurance tools.
-
February 02, 2022The Amazon Scholar and Johns Hopkins University professor was honored for “pioneering contributions to subspace clustering”.
-
January 14, 2022A new metric-learning loss function groups together superclasses and learns commonalities within them.
-
January 10, 2022Method uses metric learning to determine whether images depict the same product.
-
January 06, 2022Amazon’s Joe Tighe on the major trends he sees in the field of computer vision.
-
January 04, 2022A combination of deep learning, natural language processing, and computer vision enables Amazon to hone in on the right amount of packaging for each product.