Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.01744
Cited By
Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks
2 October 2024
Mengzhao Jia
Wenhao Yu
Kaixin Ma
Tianqing Fang
Zhihan Zhang
Siru Ouyang
Hongming Zhang
Meng-Long Jiang
Dong Yu
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks"
3 / 3 papers shown
Title
WebEvolver: Enhancing Web Agent Self-Improvement with Coevolving World Model
Tianqing Fang
H. M. Zhang
Z. Zhang
Kaixin Ma
W. Yu
Haitao Mi
Dong Yu
LLMAG
KELM
99
0
0
23 Apr 2025
Baichuan-Omni-1.5 Technical Report
Yadong Li
J. Liu
Tao Zhang
Tao Zhang
S. Chen
...
Jianhua Xu
Haoze Sun
Mingan Lin
Zenan Zhou
Weipeng Chen
AuLLM
64
10
0
28 Jan 2025
MultiChartQA: Benchmarking Vision-Language Models on Multi-Chart Problems
Zifeng Zhu
Mengzhao Jia
Z. Zhang
Lang Li
Meng-Long Jiang
LRM
37
3
0
18 Oct 2024
1