Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Visual Prompt
Multimodal
Weifeng Lin*, Xinyu Wei*, Ruichuan An, Peng Gao, Bocheng Zou, Yulin Luo, Siyuan Huang, Shanghang Zhang, Hongsheng Li
ICLR 2025, First Available on: 2024-03-29, Published on: 2025-01-22, Last Update on: 2025-02-22