-
IEEE BigData 20232023Hierarchies are common structures used to organize data, such as e-commerce hierarchies associated with product data. With these product hierarchies, we aim to learn hierarchy-aware product text embeddings to improve fine-tuning performance on a variety of downstream e-commerce tasks. Existing methods leverage hierarchies by either aligning the text embeddings to separate hierarchical embeddings or by aligning
-
2023An effective approach to design automated Question Answering (QA) systems is to efficiently retrieve answers from pre-computed databases containing question/answer pairs. One of the main challenges to this design is the lack of training/testing data. Existing resources are limited in size and topics and either do not consider answers (question-question similarity only) or their quality in the annotation
-
2023Deep Metric Learning (DML) methods aim at learning an embedding space in which distances are closely related to the inherent semantic similarity of the inputs. Previous studies have shown that popular benchmark datasets often contain numerous wrong labels, and DML methods are susceptible to them. Intending to study the effect of realistic noise, we create an ontology of the classes in a dataset and use
-
CIKM 2023 Workshop Personalized Generative AI2023Personalization, the ability to tailor a system to individual users, is an essential factor in user experience with natural language process- ing (NLP) systems. With the emergence of Large Language Models (LLMs), a key question is how to leverage these models to better personalize user experiences. To personalize a language model’s output, a straightforward approach is to incorporate past user data into
-
CIKM 20232023Change Point Detection (CPD) models are used to identify abrupt changes in the distribution of a data stream and have a widespread practical use. CPD methods generally compare the distribution of data sequences before and after a given time step to infer if there is a shift in distribution at the said time step. Numerous divergence measures, which measure distance between data distributions of sequence
Related content
-
October 06, 2023Leveraging a large vision-language foundation model enables state-of-the-art performance in remote-object grounding.
-
September 26, 2023Time series forecasting enables up-to-the-minute trend recognition, while novel two-step training process improves forecast accuracy.
-
September 14, 2023In a keynote address, the Amazon International vice president will discuss recommendations in directed graphs, training models whose target labels change, and using prediction uncertainty to improve model performance.
-
August 04, 2023Conference general chair and Amazon Scholar Yizhou Sun on modeling long-range dependencies, improving efficiency, and new causal models.
-
August 03, 2023Assessing the absolute utility of query results, rather than just their relative utility, improves learning-to-rank models.
-
June 26, 2023How phonetically blended results (PBR) help ensure customers find the content they were actually asking for.
-
June 21, 2023The senior applied science manager envisions machine learning as the path to a better experience for Amazon customers.
-
June 06, 2023New approach speeds graph-based search by 20% to 60%, regardless of graph construction method.
-
May 17, 2023The Amazon senior principal scientist coauthored a 2010 paper that introduced a new way to develop algorithms that make personalized recommendations for website users.
-
April 11, 2023The collaboration supports education, community outreach, and the application of academic research to video streaming and robotics.
-
March 21, 2023Tailoring neighborhood sizes and sampling probability to nodes’ degree of connectivity improves the utility of graph-neural-network embeddings by as much as 230%.
-
March 14, 2023Ren Zhang and her team tackle the interesting science challenges behind surfacing the most relevant offerings.
-
March 10, 2023Augmenting query-product graphs with hypergraphs describing product-product relationships improves recall score by more than 48%.
-
March 07, 2023Using reinforcement learning improves candidate selection and ranking for search, ad platforms, and recommender systems.
-
October 11, 2022Dual embeddings of each node, as both source and target, and a novel loss function enable 30% to 160% improvements over predecessors.
-
October 05, 2022Dataset that requires question-answering models to look up multiple facts and perform comparisons bridges a significant gap in the field.
-
September 20, 2022Adapting natural-language-processing techniques to recommendation systems and algorithmic fairness are two central topics at this year’s conference.
-
September 02, 2022Method would enable customers to evaluate supporting evidence for tip reliability.
-
August 25, 2022Launched under the auspices of the KDD Cup at KDD 2022, the competition included the release of a new product query dataset.
-
July 15, 2022New method optimizes the twin demands of retrieving relevant content and filtering out bad content.
-
July 08, 2022New model sets new standard in accuracy while enabling 60-fold speedups.