ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2508.07616
  4. Cited By
ThinkTuning: Instilling Cognitive Reflections without Distillation
v1v2 (latest)

ThinkTuning: Instilling Cognitive Reflections without Distillation

11 August 2025
Aswin Rrv
Jacob Dineen
Divij Handa
Md Nayem Uddin
Mihir Parmar
Chitta Baral
Ben Zhou
    ReLMLRM
ArXiv (abs)PDFHTML

Papers citing "ThinkTuning: Instilling Cognitive Reflections without Distillation"

4 / 4 papers shown
Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning
Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning
Baolong Bi
Shenghua Liu
Yiwei Wang
Siqian Tong
Lingrui Mei
Yuyao Ge
Yilong Xu
Jiafeng Guo
Xueqi Cheng
OffRLLRM
278
5
0
15 Nov 2025
Evaluating Medical LLMs by Levels of Autonomy: A Survey Moving from Benchmarks to Applications
Evaluating Medical LLMs by Levels of Autonomy: A Survey Moving from Benchmarks to Applications
Xiao Ye
Jacob Dineen
Zhaonan Li
Zhikun Xu
Weiyu Chen
...
Ji-Eun Irene Yum
Muhammad Ali Khan
Muhammad Umar Afzal
Irbaz B. Riaz
Ben Zhou
LM&MAELM
197
1
0
20 Oct 2025
OptAgent: Optimizing Query Rewriting for E-commerce via Multi-Agent Simulation
OptAgent: Optimizing Query Rewriting for E-commerce via Multi-Agent Simulation
Divij Handa
David Blincoe
Orson Adams
Yinlin Fu
176
1
0
04 Oct 2025
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
Yang Zhou
Sunzhu Li
Shunyu Liu
Wenkai Fang
Jiale Zhao
...
Hengtong Lu
Wei Chen
Yan Xie
Mingli Song
Weilong Dai
LRM
265
8
0
23 Aug 2025
1
Page 1 of 1