Bocheng Zou

Publications

Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want

Visual Prompt Multimodal

Weifeng Lin*, Xinyu Wei*, Ruichuan An, Peng Gao, Bocheng Zou, Yulin Luo, Siyuan Huang, Shanghang Zhang, Hongsheng Li

ICLR 2025, First Available on: 2024-03-29, Published on: 2025-01-22, Last Update on: 2025-02-22

[arXiv] [Project Page]
VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation

Vector Graphics Multimodal Benchmark

Bocheng Zou*, Mu Cai*, Jianrui Zhang, Yong Jae Lee

EMNLP 2024, First Available on: 2024-07-15, Published on: 2024-11-16, Last Update on: 2024-08-29

[arXiv] [DOI] [Project Page]
LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model

Data-Centric AI Multimodal

Yulin Luo*, Ruichuan An*, Bocheng Zou, Yiming Tang, Jiaming Liu, Shanghang Zhang

ECCV 2024, First Available on: 2024-05-03, Published on: 2024-10-25, Last Update on: 2024-07-24

[arXiv] [DOI] [Project Page]
UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens

Personalization Multimodal

Ruichuan An*, Sihan Yang*, Renrui Zhang, zijun shen, Ming Lu, Gaole Dai, Hao Liang, Ziyu Guo, Shilin Yan, Yulin Luo, Bocheng Zou, Chaoqun Yang, Wentao Zhang

arXiv, First Available on: 2025-05-20, Last Update on: 2025-05-22

[arXiv]