Title |
---|
![]() An Embodied Generalist Agent in 3D World Jiangyong Huang Silong Yong Xiaojian Ma Xiongkun Linghu Puhao Li Yan Wang Qing Li Song-Chun Zhu Baoxiong Jia Siyuan Huang |
![]() A Systematic Evaluation of GPT-4V's Multimodal Capability for Medical
Image Analysis Yingshu Li Yunyi Liu Zhanyu Wang Xinyu Liang Lei Wang Lingqiao Liu Leyang Cui Zhaopeng Tu Longyue Wang Luping Zhou |
![]() Rephrase, Augment, Reason: Visual Grounding of Questions for
Vision-Language Models Archiki Prasad Elias Stengel-Eskin Mohit Bansal |