-
IEEE BigData 20232023The global e-commerce store needs to ensure compliance with various regulations at local, national, and international levels. One business use case is to identify face masks to avoid price gouging during times of high demand. In order to keep billions of items safe and legally compliant, it is important to ensure accurate classifications. Classification revisers aim to enhance classification accuracy by
-
CIKM 20232023Change Point Detection (CPD) models are used to identify abrupt changes in the distribution of a data stream and have a widespread practical use. CPD methods generally compare the distribution of data sequences before and after a given time step to infer if there is a shift in distribution at the said time step. Numerous divergence measures, which measure distance between data distributions of sequence
-
CIKM 20232023Developing text mining approaches to mine aspects from customer reviews has been well-studied due to its importance in understand-ing customer needs and product attributes. In contrast, it remains unclear how to predict the future emerging aspects of a new product that currently has little review information. This task, which we named product aspect forecasting, is critical for recommending new products
-
NeurIPS 2023 Workshop on SyntheticData4ML2023The emergence of Large Language Models (LLMs) with capabilities like In-Context Learning (ICL) has ushered in new possibilities for data generation across various domains while minimizing the need for extensive data collection and modeling techniques. Researchers have explored ways to use this generated synthetic data to optimize smaller student models for reduced deployment costs and lower latency in downstream
-
NeurIPS 2023 Workshop on Table Representation Learning2023Tabular neural network (NN) has attracted remarkable attentions and its recent advances have gradually narrowed the performance gap with respect to tree-based models on many public datasets. While the mainstreams focus on calibrating NN to fit tabular data, we emphasize the importance of homogeneous embeddings and alternately concentrate on regularizing tabular inputs through supervised pretraining. Specifically
Related content
-
July 13, 2021Innovative faculty proposals will explore various aspects of trustworthy machine learning.
-
July 7, 2021James Hensman joins an effort to expand machine learning talent for UN sustainability goals.
-
June 29, 2021How Amazon’s Delivery Experience team acts as a concierge for customers.
-
June 28, 2021Didn't get the opportunity to attend the summit earlier this month? Now available on demand: Presentations on the science of machine learning by leading scholars, a fireside chat with Andrew Ng, and more career-growth content.
-
June 22, 2021Scientists describe the use of privacy-preserving machine learning to address privacy challenges in XGBoost training and prediction.
-
June 21, 2021Özer’s paper published in INFORMS’ Management Science 2021 explores the dynamics behind “cheap-talk” communications.