ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2508.11737
  4. Cited By
Ovis2.5 Technical Report

Ovis2.5 Technical Report

15 August 2025
Shiyin Lu
Yan Zhao
Yu Xia
Yuwei Hu
Shanshan Zhao
Yanqing Ma
Zhichao Wei
Yinglun Li
Lunhao Duan
Jianshan Zhao
Yuxuan Han
Haijun Li
Wanying Chen
J. Tang
Chengkun Hou
Zhixing Du
Tianli Zhou
Wenjie Zhang
Huping Ding
Jiahe Li
Wen Li
Gui Hu
Yiliang Gu
Siran Yang
Jiamang Wang
Hailong Sun
Yibo Wang
Hui Sun
Jinlong Huang
Yuping He
Shengze Shi
Weihong Zhang
Guodong Zheng
Junpeng Jiang
Sensen Gao
Yi Wu
Sijia Chen
Yuhui Chen
Qing-Guo Chen
Zhao Xu
Weihua Luo
Kaifu Zhang
    VLMLRM
ArXiv (abs)PDFHTMLHuggingFace (97 upvotes)Github (1330★)

Papers citing "Ovis2.5 Technical Report"

13 / 13 papers shown
Jina-VLM: Small Multilingual Vision Language Model
Jina-VLM: Small Multilingual Vision Language Model
Andreas Koukounas
Georgios Mastrapas
Florian Hönicke
Sedigheh Eslami
Guillaume Roncari
Scott Martens
Han Xiao
MLLM
356
0
0
03 Dec 2025
Ovis-Image Technical Report
Ovis-Image Technical Report
Guo-Hua Wang
Liangfu Cao
Tianyu Cui
Minghao Fu
Xiaohao Chen
...
Jianshan Zhao
Lan Li
Bowen Fu
Jiaqi Liu
Qing-Guo Chen
VLM
533
0
0
28 Nov 2025
You Only Forward Once: An Efficient Compositional Judging Paradigm
You Only Forward Once: An Efficient Compositional Judging Paradigm
Tianlong Zhang
Hongwei Xue
Shilin Yan
Di Wu
Chen Xu
Y. Yang
134
0
0
20 Nov 2025
PRISMM-Bench: A Benchmark of Peer-Review Grounded Multimodal Inconsistencies
PRISMM-Bench: A Benchmark of Peer-Review Grounded Multimodal Inconsistencies
Lukas Selch
Yufang Hou
Muhammad Jehanzeb Mirza
Sivan Doveh
James Glass
Rogerio Feris
Wei Lin
227
0
0
18 Oct 2025
Scope: Selective Cross-modal Orchestration of Visual Perception Experts
Scope: Selective Cross-modal Orchestration of Visual Perception Experts
Tianyu Zhang
Suyuchen Wang
Chao Wang
Juan A. Rodriguez
Ahmed Masry
Xiangru Jian
Yoshua Bengio
Perouz Taslakian
MoE
277
0
0
14 Oct 2025
FG-CLIP 2: A Bilingual Fine-grained Vision-Language Alignment Model
FG-CLIP 2: A Bilingual Fine-grained Vision-Language Alignment Model
Chunyu Xie
Bin Wang
Fanjing Kong
Jincheng Li
Dawei Liang
Ji Ao
Dawei Leng
Yuhui Yin
VLM
247
3
0
13 Oct 2025
PhysToolBench: Benchmarking Physical Tool Understanding for MLLMs
PhysToolBench: Benchmarking Physical Tool Understanding for MLLMs
Zixin Zhang
Kanghao Chen
Xingwang Lin
Lutao Jiang
Xu Zheng
Yuanhuiyi Lyu
Litao Guo
Yinchuan Li
Ying-Cong Chen
94
3
0
10 Oct 2025
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
Yunlong Tang
Jing Bi
Pinxin Liu
Zhenyu Pan
Mingqian Feng
...
Zeliang Zhang
Daiki Shimada
Han Liu
Jiebo Luo
Chenliang Xu
MLLMOffRLVLMLRM
745
8
0
06 Oct 2025
Efficient Test-Time Scaling for Small Vision-Language Models
Efficient Test-Time Scaling for Small Vision-Language Models
Mehmet Onurcan Kaya
Desmond Elliott
Dim P. Papadopoulos
VLM
188
2
0
03 Oct 2025
From Perception to Cognition: A Survey of Vision-Language Interactive Reasoning in Multimodal Large Language Models
From Perception to Cognition: A Survey of Vision-Language Interactive Reasoning in Multimodal Large Language Models
Chenyue Zhou
Mingxuan Wang
Yanbiao Ma
Chenxu Wu
Wanyi Chen
...
Guoli Jia
Lingling Li
Z. Lu
Y. Lu
Wenhan Luo
LRM
453
9
0
29 Sep 2025
Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow
Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow
Xinlei Yu
C. Xu
Guibin Zhang
Yongbo He
Zhangquan Chen
...
Jiangning Zhang
Yue Liao
Xiaobin Hu
Yu-Gang Jiang
Shuicheng Yan
243
3
0
26 Sep 2025
SAIL-VL2 Technical Report
SAIL-VL2 Technical Report
Weijie Yin
Yongjie Ye
Fangxun Shu
Yue Liao
Zijian Kang
...
Han Wang
Wenzhuo Liu
Xiao Liang
Shuicheng Yan
Chao Feng
LRMVLM
296
4
0
17 Sep 2025
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe
Tianyu Yu
Zefan Wang
Chongyi Wang
Fuwei Huang
Wenshuo Ma
...
Ning Ding
Xu Han
Xingtai Lv
Zhiyuan Liu
Maosong Sun
MLLMVLM
197
24
0
16 Sep 2025
1