ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2412.08393
  4. Cited By
Learning to Reason via Self-Iterative Process Feedback for Small
  Language Models

Learning to Reason via Self-Iterative Process Feedback for Small Language Models

11 December 2024
Kaiyuan Chen
Jin Wang
Xuejie Zhang
    LRM
    ReLM
ArXivPDFHTML

Papers citing "Learning to Reason via Self-Iterative Process Feedback for Small Language Models"

1 / 1 papers shown
Title
Training Small Reasoning LLMs with Cognitive Preference Alignment
Training Small Reasoning LLMs with Cognitive Preference Alignment
Wenrui Cai
Chengyu Wang
Junbing Yan
Jun Huang
Xiangzhong Fang
LRM
26
0
0
14 Apr 2025
1