ViGoR: Improving visual grounding of large vision language models with fine-grained reward modeling
2024
Last updated August 8, 2024
Research areas