[CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding
agent robotics vlm vision-and-language visual-grounding 3d-scene-understanding large-language-models gpt-4o vlm-grounder
-
Updated
Nov 26, 2024 - Python