ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.00722
  4. Cited By
Token-level Proximal Policy Optimization for Query Generation

Token-level Proximal Policy Optimization for Query Generation

1 November 2024
Yichen Ouyang
Lu Wang
Fangkai Yang
Pu Zhao
Chenghua Huang
Jianfeng Liu
Bochen Pang
Yaming Yang
Yuefeng Zhan
Hao Sun
Qingwei Lin
Saravan Rajmohan
Weiwei Deng
Dongmei Zhang
Feng Sun
Qi Zhang
    OffRL
ArXivPDFHTML

Papers citing "Token-level Proximal Policy Optimization for Query Generation"

1 / 1 papers shown
Title
Fine-Grained Reward Optimization for Machine Translation using Error Severity Mappings
Fine-Grained Reward Optimization for Machine Translation using Error Severity Mappings
Miguel Moura Ramos
Tomás Almeida
Daniel Vareta
Filipe Azevedo
Sweta Agrawal
Patrick Fernandes
André F. T. Martins
31
1
0
08 Nov 2024
1