GEM: Graph-enhanced mixture-of-experts with ReAct agents for dialogue state tracking

Ziqi Zhu; Adithya Suresh; Tomal Deb; Iman Abbasnejad

Publication

GEM: Graph-enhanced mixture-of-experts with ReAct agents for dialogue state tracking

By Ziqi Zhu, Adithya Suresh, Tomal Deb, Iman Abbasnejad

2026

Download Copy BibTeX

Share

Download

Copy BibTeX

Share

Dialogue State Tracking (DST) requires precise extraction of structured information from multi-domain conversations, a task where Large Language Models (LLMs) struggle despite their impressive general capabilities. We present GEM (Graph-Enhanced Mixture-of-Experts), a novel framework that combines language models and graph-structured dialogue understanding with ReAct agent-based reasoning for superior DST performance. Our approach dynamically routes between specialized experts: a Graph Neural Network that captures dialogue structure and turn-level dependencies, and a fine tuned T5-Small encoder-decoder for sequence modeling, coordinated by an intelligent router. For complex value generation tasks, we integrate ReAct agents that perform structured reasoning over dialogue context. On MultiWOZ 2.2, GEM achieves 65.19% Joint Goal Accuracy, substantially outperforming end-to-end LLM approaches (best: 38.43%) and surpassing state-of-the-art (SOTA) methods including TOATOD (63.79%), D3ST (58.70%), and Diable (56.48%). Our graph-enhanced mixture-of-experts architecture with ReAct integration demonstrates that combining structured dialogue representation with dynamic expert routing and agent-based reasoning provides a powerful paradigm for dialogue state tracking, achieving superior accuracy while maintaining computational efficiency through selective expert activation.

GEM: Graph-enhanced mixture-of-experts with ReAct agents for dialogue state tracking

Latest news

Work with us