ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.02479
  4. Cited By
BRAIn: Bayesian Reward-conditioned Amortized Inference for natural
  language generation from feedback
v1v2 (latest)

BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback

4 February 2024
Gaurav Pandey
Yatin Nandwani
Tahira Naseem
Mayank Mishra
Guangxuan Xu
Lucian Popa
Sachindra Joshi
Asim Munawar
Ramón Fernández Astudillo
    BDL
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)Github

Papers citing "BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback"

4 / 4 papers shown
Optimal Policy Minimum Bayesian Risk
Optimal Policy Minimum Bayesian Risk
Ramón Fernandez Astudillo
Md Arafat Sultan
Aashka Trivedi
Yousef El-Kurdi
Tahira Naseem
Radu Florian
Salim Roukos
OffRL
318
2
0
22 May 2025
PIPA: Preference Alignment as Prior-Informed Statistical Estimation
PIPA: Preference Alignment as Prior-Informed Statistical Estimation
Junbo Li
Zinan Lin
Qiang Liu
OffRL
521
1
0
09 Feb 2025
Nemotron-4 340B Technical Report
Nemotron-4 340B Technical Report
Nvidia
:
Bo Adler
Niket Agarwal
Ashwath Aithal
...
Jimmy Zhang
Jing Zhang
Vivienne Zhang
Yian Zhang
Chen Zhu
337
122
0
17 Jun 2024
HelpSteer2: Open-source dataset for training top-performing reward
  models
HelpSteer2: Open-source dataset for training top-performing reward models
Zhilin Wang
Yi Dong
Olivier Delalleau
Jiaqi Zeng
Gerald Shen
Daniel Egert
Jimmy J. Zhang
Makesh Narsimhan Sreedhar
Oleksii Kuchaiev
AI4TS
406
197
0
12 Jun 2024
1
Page 1 of 1