Proposer-agent-evaluator (PAE): Autonomous skill discovery for foundation model internet agents

Yifei Zhou; Qianlan Yang; Kaixiang Lin; Min Bai; Xiong Zhou; Yu-Xiong Wang; Sergey Levine; Erran Li

Publication

Proposer-agent-evaluator (PAE): Autonomous skill discovery for foundation model internet agents

By Yifei Zhou, Qianlan Yang, Kaixiang Lin, Min Bai, Xiong Zhou, Yu-Xiong Wang, Sergey Levine, Erran Li

2025

Download Copy BibTeX

Share

Download

Copy BibTeX

Share

A generalist foundation model agent needs to have a large and diverse skill repertoire, such as finding directions between two travel locations and buying specific items from the Internet. If each skill needs to be specified manually through a fixed set of human-annotated instructions, the agent’s skill repertoire will necessarily be limited due to the scalability of human-annotated instructions. In this work, we address this challenge by proposing Proposer-Agent-Evaluator (PAE), an effective learning system that enables foundation model agents to autonomously discover and practice skills in the wild. After a context-aware task proposer generates instructions based on website information, the agent policy attempts those tasks in the real world with resulting trajectories evaluated by an autonomous VLM-based success evaluator. The success evaluation serves as the reward signal for the agent to refine its policies through RL. We validate PAE on challenging vision-based web navigation, using both real-world and self hosted websites from WebVoyager and WebArena. Our results show that PAE significantly improves the zero-shot generalization capability of VLM Internet agents (around 50% relative improvement) to both unseen tasks and websites.

Proposer-agent-evaluator (PAE): Autonomous skill discovery for foundation model internet agents

Latest news

Work with us