ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.23214
28
5

Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval

30 October 2024
Sheryl Hsu
Omar Khattab
Chelsea Finn
Archit Sharma
    KELM
    RALM
ArXivPDFHTML
Abstract

The hallucinations of large language models (LLMs) are increasingly mitigated by allowing LLMs to search for information and to ground their answers in real sources. Unfortunately, LLMs often struggle with posing the right search queries, especially when dealing with complex or otherwise indirect topics. Observing that LLMs can learn to search for relevant facts by trying\textit{trying}trying different queries and learning to up-weight queries that successfully produce relevant results, we introduce Le‾\underline{Le}Le​arning to Re‾\underline{Re}Re​trieve by T‾\underline{T}T​rying (LeReT), a reinforcement learning framework that explores search queries and uses preference-based optimization to improve their quality. LeReT can improve the absolute retrieval accuracy by up to 29% and the downstream generator evaluations by 17%. The simplicity and flexibility of LeReT allows it to be applied to arbitrary off-the-shelf retrievers and makes it a promising technique for improving general LLM pipelines. Project website: http://sherylhsu.com/LeReT/.

View on arXiv
Comments on this paper