Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Enhancing LLM-as-a-judge via multi-agent collaboration

Yiyue Qian, Shinan Zhang, Yun Zhou, Haibo Ding, Diego Socolinsky, Yi Zhang

AAAI 2025 Workshop on Advancing LLM-Based Multi-Agent Collaboration

2025

Large Language Models (LLMs) have revolutionized AI-generated content evaluation, with the LLM-as-a-Judge paradigm becoming increasingly popular. However, current single-LLM evaluation approaches face significant challenges, including inconsistent judgments and inherent biases from pre-training data. To address these limitations, we propose CollabEval, a novel multi-agent evaluation framework that implements

Conversational AI
Zero-shot 3D question answering via voxel-based dynamic token compression

Hsiang-Wei Huang, Fu-Chen Chen, Wenhao Chai, Che-Chun Su, Lu Xia, Sanghun Jung, Cheng-Yen Yang, Jenq-Neng Hwang, Min Sun, Cheng-Hao Kuo

CVPR 2025

2025

Recent advancements in 3D Large Multi-modal Models (3D-LMMs) have driven significant progress in 3D question answering. However, recent multi-frame VisionLanguage Models (VLMs) demonstrate superior performance compared to 3D-LMMs on 3D question answering tasks, largely due to the greater scale and diversity of available 2D image data in contrast to the more limited 3D data. Multi-frame VLMs, although achieving

Computer vision
Privacy and fairness in machine learning: A survey

Sina Shaham, Arash Hajisafi, Minh K. Quan, Dinh C. Nguyen, Bhaskar Krishnamachari, Charith Peris, Gabriel Ghinita, Cyrus Shahabi, Pubudu N. Pathirana

IEEE Transactions on Artificial Intelligence

2025

Privacy and fairness are two crucial pillars of responsible Artificial Intelligence (AI) and trustworthy Machine Learning (ML). Each objective has been independently studied in the literature with the aim of reducing utility loss in achieving them. Despite the significant interest attracted from both academia and industry, there remains an immediate demand for more in-depth research to unravel how these

Conversational AI
Mind the semantic gap: Semantic efficiency in human computer interfaces

James Horsley

Frontiers in Artificial Intelligence

2025

As we become increasingly dependent on technology in our daily lives, the usability of HCIs is a key driver of individual empowerment for us all. A primary focus of AI systems has been to make HCIs easier to use by identifying what users need and agentively taking over some of the cognitive work users would have otherwise performed, as such, they are becoming our delegates. To become effective and reliable

Conversational AI
Monte Carlo Temperature: A robust sampling strategy for LLM’s uncertainty quantification methods

Nicola Cecere, Andrea Bacciu, Ignacio Fernandez Tobias, Amin Mantrach

NAACL 2025 Workshop on TrustNLP, ICLR 2025

2025

Uncertainty quantification (UQ) in Large Language Models (LLMs) is essential for their safe and reliable deployment, particularly in critical applications where incorrect outputs can have serious consequences. Current UQ methods typically rely on querying the model multiple times using non-zero temperature sampling to generate diverse outputs for uncertainty estimation. However, the impact of selecting

Conversational AI

Automating hallucination detection with chain-of-thought reasoning

Erica Salinas, Shayan Ali Akbar

April 11, 2025

Novel three-pronged approach combines claim-level evaluations, chain-of-thought reasoning, and classification of hallucination error types.

Conversational AI
Training large language models more efficiently

Dhananjay Ram, Nikolaos Pappas

March 27, 2025

Training separate models on different datasets and then merging them reduces computational costs by as much as 91%.

Conversational AI
Amazon Nova AI Challenge accelerating the field of generative AI

Staff writer

March 10, 2025

Inaugural global university competition focused on advancing secure, trusted AI-assisted software development.

Conversational AI
Training code generation models to debug their own outputs

Varun Kumar

February 20, 2025

Using large language models to generate training data and updating models through both fine tuning and reinforcement learning improves the success rate of code generation by 39%.

Conversational AI
Lightweight LLM for converting text to structured data

Karim Bouyarmane

February 06, 2025

Novel training procedure and decoding mechanism enable model to outperform much larger foundation model prompted to perform the same task.

Conversational AI
Unlocking insights from qualitative text with LLM-enhanced topic modeling

Sreyoshi Bhaduri, Satya Kapoor

December 11, 2024

LLM-augmented clustering enables QualIT to outperform other topic-modeling methods in both topic coherence and topic diversity.

Conversational AI

Conversational AI

Publications

Related content

Work with us