ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.15607
  4. Cited By
From Problem-Solving to Teaching Problem-Solving: Aligning LLMs with Pedagogy using Reinforcement Learning
v1v2 (latest)

From Problem-Solving to Teaching Problem-Solving: Aligning LLMs with Pedagogy using Reinforcement Learning

21 May 2025
David Dinucu-Jianu
Jakub Macina
Nico Daheim
Ido Hakimi
Iryna Gurevych
Mrinmaya Sachan
    KELMLRM
ArXiv (abs)PDFHTMLHuggingFace (3 upvotes)Github (19★)

Papers citing "From Problem-Solving to Teaching Problem-Solving: Aligning LLMs with Pedagogy using Reinforcement Learning"

5 / 5 papers shown
Cultivating Helpful, Personalized, and Creative AI Tutors: A Framework for Pedagogical Alignment using Reinforcement Learning
Cultivating Helpful, Personalized, and Creative AI Tutors: A Framework for Pedagogical Alignment using Reinforcement Learning
Siyu Song
Wentao Liu
Y. Lu
Ruohua Zhang
Tao Liu
...
X. Wang
Aimin Zhou
Fei Tan
Bo Jiang
Hao Hao
AI4Ed
188
0
0
27 Jul 2025
RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning
RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning
Zihan Wang
Kaidi Wang
Q. Wang
Pingyue Zhang
Linjie Li
...
Jiajun Wu
L. Fei-Fei
Lijuan Wang
Yejin Choi
Pengfei Yu
706
119
0
24 Apr 2025
Training LLM-based Tutors to Improve Student Learning Outcomes in Dialogues
Training LLM-based Tutors to Improve Student Learning Outcomes in DialoguesInternational Conference on Artificial Intelligence in Education (AIED), 2025
Alexander Scarlatos
Naiming Liu
Jaewook Lee
Richard Baraniuk
Andrew Lan
418
17
0
09 Mar 2025
Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models
Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models
Alon Albalak
Duy Phung
Nathan Lile
Rafael Rafailov
Kanishk Gandhi
...
Anikait Singh
Chase Blagden
Robert Z. Sparks
Dakota Mahan
Nick Haber
OffRLLRM
262
51
0
24 Feb 2025
Towards the Pedagogical Steering of Large Language Models for Tutoring: A Case Study with Modeling Productive Failure
Towards the Pedagogical Steering of Large Language Models for Tutoring: A Case Study with Modeling Productive FailureAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Romain Puech
Jakub Macina
Julia Chatain
Mrinmaya Sachan
Manu Kapur
AI4Ed
291
11
0
03 Oct 2024
1