ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.03387
  4. Cited By
LIMO: Less is More for Reasoning

LIMO: Less is More for Reasoning

5 February 2025
Yixin Ye
Zhen Huang
Yang Xiao
Ethan Chern
Shijie Xia
Pengfei Liu
    LRM
ArXivPDFHTML

Papers citing "LIMO: Less is More for Reasoning"

50 / 96 papers shown
Title
Table-R1: Inference-Time Scaling for Table Reasoning
Table-R1: Inference-Time Scaling for Table Reasoning
Zheyuan Yang
Lyuhao Chen
Arman Cohan
Yilun Zhao
LMTD
ReLM
LRM
46
1
0
29 May 2025
Don't Take the Premise for Granted: Evaluating the Premise Critique Ability of Large Language Models
Don't Take the Premise for Granted: Evaluating the Premise Critique Ability of Large Language Models
Jinzhe Li
Gengxu Li
Yi-Ju Chang
Yuan Wu
AAML
ELM
LRM
46
0
0
29 May 2025
Are Reasoning Models More Prone to Hallucination?
Are Reasoning Models More Prone to Hallucination?
Zijun Yao
Y. Liu
Yanxu Chen
Jianhui Chen
Junfeng Fang
Lei Hou
Juanzi Li
Tat-Seng Chua
ReLM
HILM
LRM
77
0
0
29 May 2025
AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models
AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models
Feng Luo
Yu-Neng Chuang
Guanchu Wang
Hoang Anh Duy Le
Shaochen Zhong
...
Jiayi Yuan
Yang Sui
Vladimir Braverman
Vipin Chaudhary
Xia Hu
LRM
56
1
0
28 May 2025
When Models Reason in Your Language: Controlling Thinking Trace Language Comes at the Cost of Accuracy
When Models Reason in Your Language: Controlling Thinking Trace Language Comes at the Cost of Accuracy
Jirui Qi
Shan Chen
Zidi Xiong
Raquel Fernández
Danielle S. Bitterman
Arianna Bisazza
LRM
41
0
0
28 May 2025
Spatial Knowledge Graph-Guided Multimodal Synthesis
Spatial Knowledge Graph-Guided Multimodal Synthesis
Yida Xue
Zhen Bi
Jinnan Yang
Jungang Lou
Ningyu Zhang
N. Zhang
15
0
0
28 May 2025
Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition
Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition
Hanting Chen
Yasheng Wang
Kai Han
Dong Li
Lin Li
...
Hailin Hu
Yehui Tang
Dacheng Tao
Xinghao Chen
Yunhe Wang
LRM
46
0
0
28 May 2025
Reinforced Reasoning for Embodied Planning
Reinforced Reasoning for Embodied Planning
Di Wu
Jiaxin Fan
Junzhe Zang
G. Wang
Wei Yin
Wenhao Li
Bo Jin
LRM
52
0
0
28 May 2025
What Makes a Good Reasoning Chain? Uncovering Structural Patterns in Long Chain-of-Thought Reasoning
What Makes a Good Reasoning Chain? Uncovering Structural Patterns in Long Chain-of-Thought Reasoning
Gangwei Jiang
Yahui Liu
Zhaoyi Li
Qi Wang
Fuzheng Zhang
Linqi Song
Ying Wei
Defu Lian
LRM
27
0
0
28 May 2025
Don't Think Longer, Think Wisely: Optimizing Thinking Dynamics for Large Reasoning Models
Don't Think Longer, Think Wisely: Optimizing Thinking Dynamics for Large Reasoning Models
Sohyun An
Ruochen Wang
Tianyi Zhou
Cho-Jui Hsieh
KELM
LRM
53
0
0
27 May 2025
Beyond Templates: Dynamic Adaptation of Reasoning Demonstrations via Feasibility-Aware Exploration
Beyond Templates: Dynamic Adaptation of Reasoning Demonstrations via Feasibility-Aware Exploration
Yong Wu
Weihang Pan
Ke Li
Chen Binhui
Ping Li
Binbin Lin
LRM
32
0
0
27 May 2025
Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective
Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective
Junnan Liu
Hongwei Liu
Linchen Xiao
Shudong Liu
Taolin Zhang
Zihan Ma
Songyang Zhang
Kai Chen
LRM
55
0
0
26 May 2025
Embodied AI with Foundation Models for Mobile Service Robots: A Systematic Review
Embodied AI with Foundation Models for Mobile Service Robots: A Systematic Review
Matthew Lisondra
B. Benhabib
G. Nejat
LM&Ro
41
0
0
26 May 2025
REARANK: Reasoning Re-ranking Agent via Reinforcement Learning
REARANK: Reasoning Re-ranking Agent via Reinforcement Learning
Le Zhang
Bo Wang
Xipeng Qiu
Siva Reddy
Aishwarya Agrawal
LLMAG
ReLM
RALM
LRM
36
0
0
26 May 2025
Does Rationale Quality Matter? Enhancing Mental Disorder Detection via Selective Reasoning Distillation
Does Rationale Quality Matter? Enhancing Mental Disorder Detection via Selective Reasoning Distillation
Hoyun Song
Huije Lee
Jisu Shin
Sukmin Cho
Changgeon Ko
Jong C. Park
AI4MH
LRM
39
1
0
26 May 2025
Concise Reasoning, Big Gains: Pruning Long Reasoning Trace with Difficulty-Aware Prompting
Concise Reasoning, Big Gains: Pruning Long Reasoning Trace with Difficulty-Aware Prompting
Yifan Wu
Jingze Shi
Bingheng Wu
Jiayi Zhang
Xiaotian Lin
Nan Tang
Yuyu Luo
LRM
60
0
0
26 May 2025
MMATH: A Multilingual Benchmark for Mathematical Reasoning
MMATH: A Multilingual Benchmark for Mathematical Reasoning
Wenyang Luo
Wayne Xin Zhao
Jing Sha
Shijin Wang
Ji-Rong Wen
ReLM
LRM
29
0
0
25 May 2025
VeriThinker: Learning to Verify Makes Reasoning Model Efficient
Zigeng Chen
Xinyin Ma
Gongfan Fang
Ruonan Yu
Xinchao Wang
LRM
127
0
0
23 May 2025
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Fanqi Wan
Weizhou Shen
Shengyi Liao
Yingcheng Shi
Chenliang Li
Ziyi Yang
Ji Zhang
Fei Huang
Jingren Zhou
Ming Yan
OffRL
LLMAG
ReLM
LRM
57
0
0
23 May 2025
Language Matters: How Do Multilingual Input and Reasoning Paths Affect Large Reasoning Models?
Zhi Rui Tam
Cheng-Kuang Wu
Yu Ying Chiu
Chieh-Yen Lin
Yun-Nung Chen
Hung-yi Lee
LRM
55
0
0
23 May 2025
Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence
Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence
Amirhosein Ghasemabadi
Keith G. Mills
Baochun Li
Di Niu
LRM
46
0
0
23 May 2025
Towards Revealing the Effectiveness of Small-Scale Fine-tuning in R1-style Reinforcement Learning
Towards Revealing the Effectiveness of Small-Scale Fine-tuning in R1-style Reinforcement Learning
Yutong Chen
Jiandong Gao
Ji Wu
ALM
139
0
0
23 May 2025
Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning
Michael Hassid
Gabriel Synnaeve
Yossi Adi
Roy Schwartz
ReLM
LRM
73
1
0
23 May 2025
MPO: Multilingual Safety Alignment via Reward Gap Optimization
MPO: Multilingual Safety Alignment via Reward Gap Optimization
Weixiang Zhao
Yulin Hu
Yang Deng
Tongtong Wu
Wenxuan Zhang
...
An Zhang
Yanyan Zhao
Bing Qin
Tat-Seng Chua
Ting Liu
67
0
0
22 May 2025
Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning
Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning
Guanting Dong
Yifei Chen
Xiaoxi Li
Jiajie Jin
Hongjin Qian
Yutao Zhu
Hangyu Mao
Guorui Zhou
Ji-Rong Wen
Ji-Rong Wen
LLMAG
SyDa
LRM
60
0
0
22 May 2025
Select2Reason: Efficient Instruction-Tuning Data Selection for Long-CoT Reasoning
Select2Reason: Efficient Instruction-Tuning Data Selection for Long-CoT Reasoning
Cehao Yang
Xueyuan Lin
Chengjin Xu
Xuhui Jiang
Xiaojun Wu
Honghao Liu
Hui Xiong
Jian Guo
LRM
56
0
0
22 May 2025
Amplify Adjacent Token Differences: Enhancing Long Chain-of-Thought Reasoning with Shift-FFN
Amplify Adjacent Token Differences: Enhancing Long Chain-of-Thought Reasoning with Shift-FFN
Yao Xu
Mingyu Xu
Fangyu Lei
Wangtao Sun
Xiangrong Zeng
Bingning Wang
Guang Liu
Shizhu He
Jun Zhao
Kang Liu
LRM
49
1
0
22 May 2025
Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
Wenhui Tan
Jiaze Li
Jianzhong Ju
Zhenbo Luo
Jian Luan
Ruihua Song
ReLM
OffRL
LRM
65
0
0
22 May 2025
Let LLMs Break Free from Overthinking via Self-Braking Tuning
Let LLMs Break Free from Overthinking via Self-Braking Tuning
Haoran Zhao
Yuchen Yan
Yongliang Shen
Haolei Xu
Wenqi Zhang
Kaitao Song
Jian Shao
Weiming Lu
Jun Xiao
Yueting Zhuang
LRM
61
0
0
20 May 2025
Shadow-FT: Tuning Instruct via Base
Shadow-FT: Tuning Instruct via Base
Taiqiang Wu
Runming Yang
Jiayi Li
Pengfei Hu
Ngai Wong
Yujiu Yang
165
0
0
19 May 2025
UFO-RL: Uncertainty-Focused Optimization for Efficient Reinforcement Learning Data Selection
UFO-RL: Uncertainty-Focused Optimization for Efficient Reinforcement Learning Data Selection
Yang Zhao
Kai Xiong
Xiao Ding
Li Du
YangouOuyang
...
Wentao Zhang
Bin Liu
Dong Hu
Bing Qin
Ting Liu
OffRL
65
0
0
18 May 2025
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization
Minghan Chen
Guikun Chen
Wenguan Wang
Yi Yang
56
2
0
18 May 2025
Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning
Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning
Xinbin Yuan
Jian Zhang
K. Li
Zhuoxuan Cai
Lujian Yao
...
Enguang Wang
Qibin Hou
Jinwei Chen
Peng-Tao Jiang
Bo Li
68
1
0
18 May 2025
CoT-Vid: Dynamic Chain-of-Thought Routing with Self Verification for Training-Free Video Reasoning
CoT-Vid: Dynamic Chain-of-Thought Routing with Self Verification for Training-Free Video Reasoning
Hongbo Jin
Ruyang Liu
Wenhao Zhang
Guibo Luo
Ge Li
LRM
48
0
0
17 May 2025
Critique-Guided Distillation: Improving Supervised Fine-tuning via Better Distillation
Critique-Guided Distillation: Improving Supervised Fine-tuning via Better Distillation
Berkcan Kapusuzoglu
Supriyo Chakraborty
Chia-Hsuan Lee
Sambit Sahu
64
0
0
16 May 2025
Spectral Policy Optimization: Coloring your Incorrect Reasoning in GRPO
Spectral Policy Optimization: Coloring your Incorrect Reasoning in GRPO
Peter Chen
Xiaopeng Li
Zhiyu Li
Xi Chen
Tianyi Lin
64
0
0
16 May 2025
Crosslingual Reasoning through Test-Time Scaling
Crosslingual Reasoning through Test-Time Scaling
Zheng-Xin Yong
Muhammad Farid Adilazuarda
Jonibek Mansurov
Ruochen Zhang
Niklas Muennighoff
Carsten Eickhoff
Genta Indra Winata
Julia Kreutzer
Stephen H. Bach
Alham Fikri Aji
LRM
ELM
349
8
0
08 May 2025
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Haoran Xu
Baolin Peng
Hany Awadalla
DongDong Chen
Yen-Chun Chen
...
Yelong Shen
Shuaiqiang Wang
Weijian Xu
Jianfeng Gao
Weizhu Chen
ReLM
LRM
114
4
0
30 Apr 2025
AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset
AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset
Ivan Moshkov
Darragh Hanley
Ivan Sorokin
Shubham Toshniwal
Christof Henkel
Benedikt Schifferer
Wei Du
Igor Gitman
ReLM
LRM
75
11
0
23 Apr 2025
Compass-V2 Technical Report
Compass-V2 Technical Report
Sophia Maria
MoE
LRM
67
0
0
22 Apr 2025
DianJin-R1: Evaluating and Enhancing Financial Reasoning in Large Language Models
DianJin-R1: Evaluating and Enhancing Financial Reasoning in Large Language Models
Jie Zhu
Qian Chen
Huaixia Dou
Junhui Li
Lifan Guo
Feng-Xiang Chen
Chuxu Zhang
LRM
88
1
0
22 Apr 2025
Dynamic Early Exit in Reasoning Models
Dynamic Early Exit in Reasoning Models
Chenxu Yang
Qingyi Si
Yongjie Duan
Zheliang Zhu
Chenyu Zhu
Zheng Lin
Zheng Lin
Li Cao
Weiping Wang
ReLM
LRM
108
14
0
22 Apr 2025
TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving
TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving
Daocheng Fu
Zijun Chen
Renqiu Xia
Qi Liu
Yuan Feng
...
Peng Gao
Junchi Yan
Botian Shi
Bo Zhang
Yu Qiao
64
1
0
22 Apr 2025
Learning to Reason under Off-Policy Guidance
Learning to Reason under Off-Policy Guidance
Jianhao Yan
Yafu Li
Zican Hu
Zhi Wang
Ganqu Cui
Xiaoye Qu
Yu Cheng
Yue Zhang
OffRL
LRM
68
8
0
21 Apr 2025
a1: Steep Test-time Scaling Law via Environment Augmented Generation
a1: Steep Test-time Scaling Law via Environment Augmented Generation
Lingrui Mei
Shenghua Liu
Yiwei Wang
Baolong Bi
Yuyao Ge
Jun Wan
Yurong Wu
Xueqi Cheng
LRM
69
2
0
20 Apr 2025
Prejudge-Before-Think: Enhancing Large Language Models at Test-Time by Process Prejudge Reasoning
Prejudge-Before-Think: Enhancing Large Language Models at Test-Time by Process Prejudge Reasoning
Jinqiao Wang
Jin Jiang
Yang Liu
Hao Fei
Xunliang Cai
LRM
61
0
0
18 Apr 2025
Climbing the Ladder of Reasoning: What LLMs Can-and Still Can't-Solve after SFT?
Climbing the Ladder of Reasoning: What LLMs Can-and Still Can't-Solve after SFT?
Yiyou Sun
Georgia Zhou
Haoran Wang
Dexun Li
Nouha Dziri
Dawn Song
ReLM
ALM
ELM
LRM
100
5
1
16 Apr 2025
Rethinking the Generation of High-Quality CoT Data from the Perspective of LLM-Adaptive Question Difficulty Grading
Rethinking the Generation of High-Quality CoT Data from the Perspective of LLM-Adaptive Question Difficulty Grading
Qianjin Yu
Keyu Wu
Zihan Chen
Chushu Zhang
Manlin Mei
Lingjun Huang
Fang Tan
Yongsheng Du
Kunlin Liu
Yurui Zhu
ELM
LRM
355
2
0
16 Apr 2025
d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning
d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning
Siyan Zhao
Devaansh Gupta
Qinqing Zheng
Aditya Grover
DiffM
LRM
AI4CE
101
4
0
16 Apr 2025
Nemotron-CrossThink: Scaling Self-Learning beyond Math Reasoning
Nemotron-CrossThink: Scaling Self-Learning beyond Math Reasoning
Syeda Nahida Akter
Shrimai Prabhumoye
Matvei Novikov
Seungju Han
Ying Lin
...
Eric Nyberg
Yejin Choi
M. Patwary
Mohammad Shoeybi
Bryan Catanzaro
ReLM
OffRL
LRM
383
2
1
15 Apr 2025
12
Next