Title |
---|
![]() AutoBench-V: Can Large Vision-Language Models Benchmark Themselves? Han Bao Yue Huang Yanbo Wang Jiayi Ye Xiangqi Wang Xiuying Chen Mohamed Elhoseiny X. Zhang Mohamed Elhoseiny Xiangliang Zhang |
![]() Self-evolving Agents with reflective and memory-augmented abilities Xuechen Liang Yangfan He Yinghui Xia Xinyuan Song Jianhui Wang ...Keqin Li Jiaqi Chen Jinsong Yang Siyuan Chen Tianyu Shi |
![]() TableBench: A Comprehensive and Complex Benchmark for Table Question Answering Xianjie Wu Jian Yang Linzheng Chai Ge Zhang Jiaheng Liu ...Xianfu Cheng Tianzhen Sun Guanglin Niu Tongliang Li Zhoujun Li |