Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2505.20258
Cited By
v1
v2 (latest)
ARM: Adaptive Reasoning Model
26 May 2025
Siye Wu
Jian Xie
Yikai Zhang
Aili Chen
Kai Zhang
Yu Su
Yanghua Xiao
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (44 upvotes)
Github (24995★)
Papers citing
"ARM: Adaptive Reasoning Model"
21 / 21 papers shown
Title
Rectifying LLM Thought from Lens of Optimization
J. Liu
Hongwei Liu
Songyang Zhang
Kai Chen
LRM
104
0
0
01 Dec 2025
ChainV: Atomic Visual Hints Make Multimodal Reasoning Shorter and Better
Y. Zhang
Ming Lu
J. Pan
Tao Huang
Kuan Cheng
Qi She
Shanghang Zhang
LRM
188
0
0
21 Nov 2025
Adaptive Dual Reasoner: Large Reasoning Models Can Think Efficiently by Hybrid Reasoning
Y. Zhang
Keyu Chen
Zhifeng Shen
Ruizhi Qiao
Xing Sun
LRM
141
0
0
11 Oct 2025
ARM2: Adaptive Reasoning Model with Vision Understanding and Executable Code
Jian Xie
Zhendong Chu
Aoxiao Zhong
Kai Zhang
Mingzhe Han
Xin Fang
Jialie Shen
Qingsong Wen
LRM
257
1
0
09 Oct 2025
CalibCLIP: Contextual Calibration of Dominant Semantics for Text-Driven Image Retrieval
Bin Kang
Bin Chen
Junjie Wang
Yulin Li
Junzhi Zhao
Zhuotao Tian
VLM
116
1
0
07 Oct 2025
Probing the Difficulty Perception Mechanism of Large Language Models
Sunbowen Lee
Qingyu Yin
Chak Tou Leong
Jialiang Zhang
Yicheng Gong
Shiwen Ni
Min Yang
Xiaoyu Shen
LRM
203
0
0
07 Oct 2025
Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models
Zhipeng Chen
Xiaobo Qin
Y. Wu
Yue Ling
Qinghao Ye
Wayne Xin Zhao
Guang Shi
OffRL
141
57
0
14 Aug 2025
Geometric-Mean Policy Optimization
Yuzhong Zhao
Yue Liu
Junpeng Liu
Jingye Chen
Xun Wu
...
Shaohan Huang
Lei Cui
Qixiang Ye
Fang Wan
Furu Wei
249
24
0
28 Jul 2025
Hierarchical Budget Policy Optimization for Adaptive Reasoning
Shangke Lyu
Linjuan Wu
Yuchen Yan
Xingyu Wu
Hao Li
Yongliang Shen
Peisheng Jiang
Weiming Lu
Jun Xiao
Yueting Zhuang
OffRL
290
3
0
21 Jul 2025
THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models
Xiao Pu
Michael Stephen Saxon
Qingfeng Lan
William Y. Wang
LRM
271
18
0
17 Apr 2025
Efficient Reasoning Models: A Survey
Sicheng Feng
Gongfan Fang
Xinyin Ma
Xinchao Wang
ReLM
LRM
880
41
0
15 Apr 2025
Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?
Chenrui Fan
Ming Li
Lichao Sun
Tianyi Zhou
LRM
328
34
0
09 Apr 2025
Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme
Yan Ma
Steffi Chern
Xuyang Shen
Yiran Zhong
Pengfei Liu
OffRL
LRM
407
13
0
03 Apr 2025
ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning
Bairu Hou
Yang Zhang
Jiabao Ji
Yujian Liu
Kaizhi Qian
Jacob Andreas
Shiyu Chang
OffRL
LRM
306
74
0
02 Apr 2025
SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild
Weihao Zeng
Yuzhen Huang
Qian Liu
Wei Liu
Keqing He
Zejun Ma
Junxian He
OffRL
ReLM
LRM
582
333
0
24 Mar 2025
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
Yang Sui
Yu-Neng Chuang
Guanchu Wang
Jiamu Zhang
Tianyi Zhang
...
Andrew Wen
Shaochen
Zhong
Hanjie Chen
Helen Zhou
OffRL
ReLM
LRM
696
260
0
20 Mar 2025
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Sara Szymkuć
Hansi Zeng
Zhenrui Yue
Jinsung Yoon
Sercan O. Arik
Dong Wang
Hamed Zamani
Jiawei Han
OffRL
AI4TS
LRM
RALM
ReLM
KELM
784
542
0
12 Mar 2025
How Well do LLMs Compress Their Own Chain-of-Thought? A Token Complexity Approach
Ayeong Lee
Ethan Che
Tianyi Peng
LRM
380
62
0
03 Mar 2025
CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation
Zhenyi Shen
Hanqi Yan
Linhai Zhang
Zhanghao Hu
Yali Du
Yulan He
LRM
610
79
0
28 Feb 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
OffRL
AI4TS
LRM
ReLM
VLM
1.2K
5,342
0
22 Jan 2025
Kimi k1.5: Scaling Reinforcement Learning with LLMs
Kimi Team
Angang Du
Bofei Gao
Bowei Xing
Changjiu Jiang
...
Zihao Huang
Ziyao Xu
Zhiyong Yang
Zonghan Yang
Zongyu Lin
OffRL
ALM
AI4TS
VLM
LRM
930
681
0
22 Jan 2025
1