Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2503.04813
Cited By
Self-Evolved Preference Optimization for Enhancing Mathematical Reasoning in Small Language Models
4 March 2025
Joykirat Singh
Tanmoy Chakraborty
A. Nambi
AI4Cl
LRM
ReLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Github
Papers citing
"Self-Evolved Preference Optimization for Enhancing Mathematical Reasoning in Small Language Models"
3 / 3 papers shown
Efficient Test-Time Scaling for Small Vision-Language Models
Mehmet Onurcan Kaya
Desmond Elliott
Dim P. Papadopoulos
VLM
268
3
0
03 Oct 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
OffRL
AI4TS
LRM
ReLM
VLM
1.8K
5,342
0
22 Jan 2025
Large Language Models Meet NLP: A Survey
Libo Qin
Qiguang Chen
Xiachong Feng
Yang Wu
Yongheng Zhang
Hai-Tao Zheng
Min Li
Wanxiang Che
Philip S. Yu
LRM
ALM
LM&MA
ELM
597
138
0
21 May 2024
1
Page 1 of 1