-
2024Large Language models (LLMs) have demonstrated impressive in-context learning (ICL) capabilities, where a LLM makes predictions for a given test input together with a few input-output pairs (demonstrations). Nevertheless, the inclusion of demonstrations leads to a quadratic increase in the computational overhead of the self-attention mechanism. Existing solutions attempt to distill lengthy demonstrations
-
Information retrieval (IR) is a pivotal component in various applications. Recent advances in machine learning (ML) have enabled the integration of ML algorithms into IR, particularly in ranking systems. While there is a plethora of research on the robustness of ML-based ranking systems, these studies largely neglect commercial e-commerce systems and fail to establish a connection between real-world and
-
WSDM 20242024Review of non-taxable products is an important internal audit which is carried out by majority of e-commerce stakeholders. This process usually cross checks the initial taxability assignments to avoid any unnecessary penalties incurred to the companies during the actual audits by the respective state compliance teams/tax departments. In order to handle millions of products sold online on e-commerce websites
-
TKDD 20242024This paper presents a comprehensive and practical guide for practitioners and end-users working with Large Language Models (LLMs) in their downstream natural language processing (NLP) tasks. We provide discussions and insights into the usage of LLMs from the perspectives of models, data, and downstream tasks. Firstly, we offer an introduction and brief summary of current language models. Then, we discuss
-
2024State-of-the-art speech models may exhibit suboptimal performance in specific population subgroups. Detecting these challenging subgroups is crucial to enhance model robustness and fairness. Traditional methods for subgroup identification typically rely on demographic information such as age, gender, and origin. However, collecting such sensitive data at deployment time can be impractical or unfeasible
Related content
-
June 5, 2020More than eight percent of interns will have applied research, and data science roles.
-
June 4, 2020Watch the recording of Natarajan's live interview with Alexa evangelist Jeff Blankenburg.
-
May 19, 2020At smaller vocabulary sizes, tokenizers trained on unannotated data work best.
-
May 15, 2020Advances by student teams drive major improvements in users' experiences with socialbots.
-
May 12, 2020Users find speech with transferred expression 9% more natural than standard synthesized speech.
-
May 7, 2020Watch the recording of Manoj Sindhwani's live interview with Alexa evangelist Jeff Blankenburg.