ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.02992
  4. Cited By
Guided Stream of Search: Learning to Better Search with Language Models
  via Optimal Path Guidance

Guided Stream of Search: Learning to Better Search with Language Models via Optimal Path Guidance

3 October 2024
Seungyong Moon
Bumsoo Park
Hyun Oh Song
    RALM
    AIFin
ArXivPDFHTML

Papers citing "Guided Stream of Search: Learning to Better Search with Language Models via Optimal Path Guidance"

1 / 1 papers shown
Title
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning
Yuxiao Qu
Matthew Y. R. Yang
Amrith Rajagopal Setlur
Lewis Tunstall
E. Beeching
Ruslan Salakhutdinov
Aviral Kumar
OffRL
64
13
0
10 Mar 2025
1