ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
  • Feedback
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2507.23242
40
0

Generalized Reinforcement Learning for Retriever-Specific Query Rewriter with Unstructured Real-World Documents

31 July 2025
Sungguk Cha
DongWook Kim
Taeseung Hahn
Mintae Kim
Youngsub Han
Byoung-Ki Jeon
    OffRL
ArXiv (abs)PDFHTML
Main:7 Pages
2 Figures
Bibliography:1 Pages
7 Tables
Abstract

Retrieval-Augmented Generation (RAG) systems rely heavily on effective query formulation to unlock external knowledge, yet optimizing queries for diverse, unstructured real-world documents remains a challenge. We introduce \textbf{RL-QR}, a reinforcement learning framework for retriever-specific query rewriting that eliminates the need for human-annotated datasets and extends applicability to both text-only and multi-modal databases. By synthesizing scenario-question pairs and leveraging Generalized Reward Policy Optimization (GRPO), RL-QR trains query rewriters tailored to specific retrievers, enhancing retrieval performance across varied domains. Experiments on industrial in-house data demonstrate significant improvements, with RL-QRmulti-modal\text{RL-QR}_{\text{multi-modal}}RL-QRmulti-modal​ achieving an 11\% relative gain in NDCG@3 for multi-modal RAG and RL-QRlexical\text{RL-QR}_{\text{lexical}}RL-QRlexical​ yielding a 9\% gain for lexical retrievers. However, challenges persist with semantic and hybrid retrievers, where rewriters failed to improve performance, likely due to training misalignments. Our findings highlight RL-QR's potential to revolutionize query optimization for RAG systems, offering a scalable, annotation-free solution for real-world retrieval tasks, while identifying avenues for further refinement in semantic retrieval contexts.

View on arXiv
Comments on this paper