Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2402.02479
Cited By
v1
v2 (latest)
BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback
4 February 2024
Gaurav Pandey
Yatin Nandwani
Tahira Naseem
Mayank Mishra
Guangxuan Xu
Lucian Popa
Sachindra Joshi
Asim Munawar
Ramón Fernández Astudillo
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Github
Papers citing
"BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback"
4 / 4 papers shown
Optimal Policy Minimum Bayesian Risk
Ramón Fernandez Astudillo
Md Arafat Sultan
Aashka Trivedi
Yousef El-Kurdi
Tahira Naseem
Radu Florian
Salim Roukos
OffRL
318
2
0
22 May 2025
PIPA: Preference Alignment as Prior-Informed Statistical Estimation
Junbo Li
Zinan Lin
Qiang Liu
OffRL
521
1
0
09 Feb 2025
Nemotron-4 340B Technical Report
Nvidia
:
Bo Adler
Niket Agarwal
Ashwath Aithal
...
Jimmy Zhang
Jing Zhang
Vivienne Zhang
Yian Zhang
Chen Zhu
337
122
0
17 Jun 2024
HelpSteer2: Open-source dataset for training top-performing reward models
Zhilin Wang
Yi Dong
Olivier Delalleau
Jiaqi Zeng
Gerald Shen
Daniel Egert
Jimmy J. Zhang
Makesh Narsimhan Sreedhar
Oleksii Kuchaiev
AI4TS
406
197
0
12 Jun 2024
1
Page 1 of 1