ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.04813
  4. Cited By
Self-Evolved Preference Optimization for Enhancing Mathematical Reasoning in Small Language Models

Self-Evolved Preference Optimization for Enhancing Mathematical Reasoning in Small Language Models

4 March 2025
Joykirat Singh
Tanmoy Chakraborty
A. Nambi
    AI4ClLRMReLM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)Github

Papers citing "Self-Evolved Preference Optimization for Enhancing Mathematical Reasoning in Small Language Models"

3 / 3 papers shown
Efficient Test-Time Scaling for Small Vision-Language Models
Efficient Test-Time Scaling for Small Vision-Language Models
Mehmet Onurcan Kaya
Desmond Elliott
Dim P. Papadopoulos
VLM
268
3
0
03 Oct 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
OffRLAI4TSLRMReLMVLM
1.8K
5,342
0
22 Jan 2025
Large Language Models Meet NLP: A Survey
Large Language Models Meet NLP: A Survey
Libo Qin
Qiguang Chen
Xiachong Feng
Yang Wu
Yongheng Zhang
Hai-Tao Zheng
Min Li
Wanxiang Che
Philip S. Yu
LRMALMLM&MAELM
597
138
0
21 May 2024
1
Page 1 of 1