GauchoAI: A proactive modular embodied AI that collaborates with humans
We introduce an embodied AI agent designed to interact with human users, achieve high mission success rates, and deliver a satisfying user experience. Our agent utilizes a modular framework, incorporating explicit state updates based on previous actions, next-action predictions derived from user input, target object detection from observations, and various exploration strategies for locating unseen targets. Moreover, the agent proactively suggests the next steps upon reaching specific states, reducing the need for detailed user commands. We showcase our approach on the Alexa Arena [Gao et al., 2023] platform, achieving an average rating of 4.29 from February 21st to March 21st. These results demonstrate the substantial potential of our embodied AI learning framework to enhance Human-Robot interactions and promote successful task completion across a variety of applications.