Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2412.01175
Cited By
v1
v2 (latest)
OBI-Bench: Can LMMs Aid in Study of Ancient Script on Oracle Bones?
International Conference on Learning Representations (ICLR), 2024
2 December 2024
Zhongfu Chen
Tingzhu Chen
Wenjun Zhang
Guangtao Zhai
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OBI-Bench: Can LMMs Aid in Study of Ancient Script on Oracle Bones?"
42 / 42 papers shown
MACEval: A Multi-Agent Continual Evaluation Network for Large Models
Z. Chen
Yuze Sun
Yuan Tian
Wenjun Zhang
Guangtao Zhai
ALM
ELM
214
0
0
12 Nov 2025
OracleAgent: A Multimodal Reasoning Agent for Oracle Bone Script Research
Caoshuo Li
Zengmao Ding
Xiaobin Hu
Bang Li
Donghao Luo
...
Feng Gao
AndyPian Wu
SevenShu
Chaoyang Wang
Chengjie Wang
LLMAG
120
1
0
30 Oct 2025
PictOBI-20k: Unveiling Large Multimodal Models in Visual Decipherment for Pictographic Oracle Bone Characters
Z. Chen
Wenjie Hua
Jinhao Li
Lirong Deng
Fan Du
Tingzhu Chen
Guangtao Zhai
102
1
0
06 Sep 2025
Interpretable Oracle Bone Script Decipherment through Radical and Pictographic Analysis with LVLMs
Kaixin Peng
Mengyang Zhao
Haiyang Yu
Teng Fu
Bin Li
140
0
0
13 Aug 2025
PuzzleBench: A Fully Dynamic Evaluation Framework for Large Multimodal Models on Puzzle Solving
Zeyu Zhang
Zhongfu Chen
Zicheng Zhang
Yuze Sun
Yuan Tian
Ziheng Jia
Chunyi Li
Xiaohong Liu
Xiongkuo Min
Guangtao Zhai
MLLM
177
5
0
15 Apr 2025
Mitigating Long-tail Distribution in Oracle Bone Inscriptions: Dataset, Model, and Benchmark
Jinhao Li
Zhongfu Chen
Runze Dong
Tingzhu Chen
Can Wang
Guangtao Zhai
DiffM
271
2
0
13 Apr 2025
DongbaMIE: A Multimodal Information Extraction Dataset for Evaluating Semantic Understanding of Dongba Pictograms
Xiaojun Bi
Shuo Li
Liang Luo
Ziyue Wang
Fuwen Luo
Weizheng Qiao
Lu Han
Ziwei Sun
Peng Li
Yang Liu
968
4
0
05 Mar 2025
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models
Le Xue
Manli Shu
Anas Awadalla
Jun Wang
An Yan
...
Zeyuan Chen
Silvio Savarese
Juan Carlos Niebles
Caiming Xiong
Ran Xu
VLM
525
141
0
16 Aug 2024
mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models
International Conference on Learning Representations (ICLR), 2024
Jiabo Ye
Haiyang Xu
Haowei Liu
Anwen Hu
Ming Yan
Qi Qian
Ji Zhang
Fei Huang
Jingren Zhou
MLLM
VLM
313
225
0
09 Aug 2024
MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Yuan Yao
Tianyu Yu
Ao Zhang
Chongyi Wang
Junbo Cui
...
Xu Han
Guoyang Zeng
Dahai Li
Zhiyuan Liu
Maosong Sun
VLM
MLLM
447
868
0
03 Aug 2024
ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Team GLM
:
Aohan Zeng
Bin Xu
Bowen Wang
...
Zhaoyu Wang
Zhen Yang
Zhengxiao Du
Zhenyu Hou
Zihan Wang
ALM
371
1,167
0
18 Jun 2024
GAIA: Rethinking Action Quality Assessment for AI-Generated Videos
Neural Information Processing Systems (NeurIPS), 2024
Zijian Chen
Wei Sun
Yuan Tian
Jun Jia
Zicheng Zhang
Jiarui Wang
Ru Huang
Xiongkuo Min
Guangtao Zhai
Wenjun Zhang
EGVM
312
35
0
10 Jun 2024
A-Bench: Are LMMs Masters at Evaluating AI-generated Images?
Zicheng Zhang
H. Wu
Chunyi Li
Yingjie Zhou
Wei Sun
Xiongkuo Min
Zijian Chen
Xiaohong Liu
Weisi Lin
Guangtao Zhai
EGVM
372
38
0
05 Jun 2024
Deciphering Oracle Bone Language with Diffusion Models
Haisu Guan
Huanxin Yang
Xinyu Wang
Shengwei Han
Yongge Liu
Lianwen Jin
Xiang Bai
Yunxing Liu
AAML
AI4CE
426
22
0
02 Jun 2024
What matters when building vision-language models?
Neural Information Processing Systems (NeurIPS), 2024
Hugo Laurençon
Léo Tronchon
Matthieu Cord
Victor Sanh
VLM
299
274
0
03 May 2024
InternLM2 Technical Report
Zheng Cai
Maosong Cao
Haojiong Chen
Kai-xiang Chen
Keyu Chen
...
Jingming Zhuo
Yi-Ling Zou
Xipeng Qiu
Yu Qiao
Dahua Lin
ALM
288
308
0
26 Mar 2024
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Haoyu Lu
Wen Liu
Bo Zhang
Bing-Li Wang
Kai Dong
...
Yaofeng Sun
Chengqi Deng
Hanwei Xu
Zhenda Xie
Chong Ruan
VLM
437
642
0
08 Mar 2024
Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception
Jun-Yan He
Yifan Wang
Lijun Wang
Huchuan Lu
Jun-Yan He
Jinpeng Lan
Bin Luo
Xuansong Xie
MLLM
VLM
224
35
0
05 Mar 2024
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method
Biao Zhang
Zhongtao Liu
Colin Cherry
Orhan Firat
LRM
280
231
0
27 Feb 2024
The Revolution of Multimodal Large Language Models: A Survey
Davide Caffagni
Federico Cocchi
Luca Barsellotti
Nicholas Moratelli
Sara Sarto
Lorenzo Baraldi
Lorenzo Baraldi
Marcella Cornia
Rita Cucchiara
LRM
VLM
355
121
0
19 Feb 2024
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model
Xiao-wen Dong
Pan Zhang
Yuhang Zang
Yuhang Cao
Sijin Yu
...
Conghui He
Xingcheng Zhang
Yu Qiao
Dahua Lin
Yuan Liu
VLM
MLLM
366
342
0
29 Jan 2024
An open dataset for oracle bone script recognition and decipherment
Scientific Data (Sci Data), 2024
Pengjie Wang
Kaile Zhang
Xinyu Wang
Shengwei Han
Yongge Liu
...
Haisu Guan
Zhebin Kuang
Lianwen Jin
Xiang Bai
Yuliang Liu
AI4CE
193
2
0
27 Jan 2024
An open dataset for the evolution of oracle bone characters: EVOBC
Haisu Guan
Jinpeng Wan
Yuliang Liu
Pengjie Wang
Kaile Zhang
Zhebin Kuang
Xinyu Wang
Xiang Bai
Lianwen Jin
279
11
0
23 Jan 2024
Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs
Computer Vision and Pattern Recognition (CVPR), 2024
Shengbang Tong
Zhuang Liu
Yuexiang Zhai
Yi-An Ma
Yann LeCun
Saining Xie
VLM
MLLM
412
552
0
11 Jan 2024
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks
Zhe Chen
Jiannan Wu
Wenhai Wang
Weijie Su
Guo Chen
...
Bin Li
Ping Luo
Tong Lu
Yu Qiao
Jifeng Dai
VLM
MLLM
635
2,168
0
21 Dec 2023
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
Computer Vision and Pattern Recognition (CVPR), 2023
Xiang Yue
Yuansheng Ni
Kai Zhang
Tianyu Zheng
Ruoqi Liu
...
Yibo Liu
Wenhao Huang
Huan Sun
Yu-Chuan Su
Wenhu Chen
OSLM
ELM
VLM
849
1,600
0
27 Nov 2023
CogVLM: Visual Expert for Pretrained Language Models
Neural Information Processing Systems (NeurIPS), 2023
Weihan Wang
Qingsong Lv
Wenmeng Yu
Wenyi Hong
Ji Qi
...
Bin Xu
Juanzi Li
Yuxiao Dong
Ming Ding
Jie Tang
VLM
MLLM
649
709
0
06 Nov 2023
Unleashing the potential of prompt engineering in Large Language Models: a comprehensive review
Banghao Chen
Zhaofeng Zhang
Nicolas Langrené
Shengxin Zhu
LLMAG
429
89
0
23 Oct 2023
Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level Vision
International Conference on Learning Representations (ICLR), 2023
Haoning Wu
Zicheng Zhang
Erli Zhang
Chaofeng Chen
Liang Liao
...
Chunyi Li
Wenxiu Sun
Qiong Yan
Guangtao Zhai
Weisi Lin
VLM
364
223
0
25 Sep 2023
Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond
Jinze Bai
Shuai Bai
Shusheng Yang
Shijie Wang
Sinan Tan
Peng Wang
Junyang Lin
Chang Zhou
Jingren Zhou
MLLM
VLM
ObjD
513
1,565
0
24 Aug 2023
AgentBench: Evaluating LLMs as Agents
International Conference on Learning Representations (ICLR), 2023
Xiao-Yang Liu
Hao Yu
Hanchen Zhang
Yifan Xu
Xuanyu Lei
...
Yu-Chuan Su
Huan Sun
Shiyu Huang
Yuxiao Dong
Jie Tang
ELM
LLMAG
526
494
0
07 Aug 2023
Toward Zero-shot Character Recognition: A Gold Standard Dataset with Radical-level Annotations
ACM Multimedia (ACM MM), 2023
Xiaolei Diao
D. Shi
Jiacheng Li
Lida Shi
Mingzhe Yue
RuiHua Qi
Chuntao Li
Hao Xu
165
13
0
01 Aug 2023
MMBench: Is Your Multi-modal Model an All-around Player?
European Conference on Computer Vision (ECCV), 2023
Yuanzhan Liu
Haodong Duan
Yuanhan Zhang
Yue Liu
Songyang Zhang
...
Yuan Liu
Conghui He
Ziwei Liu
Kai-xiang Chen
Dahua Lin
674
1,646
0
12 Jul 2023
Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
International Conference on Learning Representations (ICLR), 2023
Fuxiao Liu
Kevin Qinghong Lin
Linjie Li
Jianfeng Wang
Yaser Yacoob
Lijuan Wang
VLM
MLLM
427
399
0
26 Jun 2023
MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models
Chaoyou Fu
Peixian Chen
Chunjiang Ge
Yulei Qin
Mengdan Zhang
...
Xing Sun
Zhenyu Qiu
Rongrong Ji
Caifeng Shan
Ran He
ELM
MLLM
769
1,219
0
23 Jun 2023
Visual Instruction Tuning
Neural Information Processing Systems (NeurIPS), 2023
Haotian Liu
Chunyuan Li
Qingyang Wu
Yong Jae Lee
SyDa
VLM
MLLM
1.1K
7,377
0
17 Apr 2023
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAG
MLLM
4.6K
20,717
0
15 Mar 2023
Large Language Models Are Human-Level Prompt Engineers
International Conference on Learning Representations (ICLR), 2022
Yongchao Zhou
Andrei Ioan Muresanu
Ziwen Han
Keiran Paster
Silviu Pitis
Harris Chan
Jimmy Ba
ALM
LLMAG
467
1,167
0
03 Nov 2022
Unsupervised Structure-Texture Separation Network for Oracle Character Recognition
IEEE Transactions on Image Processing (IEEE TIP), 2022
Mei Wang
Weihong Deng
Chenguang Liu
206
43
0
13 May 2022
Recognition of Oracle Bone Inscriptions by using Two Deep Learning Models
International Journal of Digital Humanities (IJDH), 2021
Yoshiyuki Fujikawa
Hengyi Li
Xuebin Yue
C. Aravinda
G. AmarPrabhu
Lin Meng
235
43
0
03 May 2021
BERTScore: Evaluating Text Generation with BERT
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
2.3K
7,458
0
21 Apr 2019
NIMA: Neural Image Assessment
Hossein Talebi
P. Milanfar
3DH
408
1,057
0
15 Sep 2017
1