Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.17546
Cited By
Exploring Domain Robust Lightweight Reward Models based on Router Mechanism
24 July 2024
Hyuk Namgoong
Jeesu Jung
Sangkeun Jung
Yoonhyung Roh
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Exploring Domain Robust Lightweight Reward Models based on Router Mechanism"
4 / 4 papers shown
Title
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
A. Blattmann
Tim Dockhorn
Sumith Kulal
Daniel Mendelevitch
Maciej Kilian
...
Zion English
Vikram S. Voleti
Adam Letts
Varun Jampani
Robin Rombach
VGen
158
1,012
0
25 Nov 2023
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
311
11,915
0
04 Mar 2022
Understanding Dataset Difficulty with
V
\mathcal{V}
V
-Usable Information
Kawin Ethayarajh
Yejin Choi
Swabha Swayamdipta
159
157
0
16 Oct 2021
Generalization in NLI: Ways (Not) To Go Beyond Simple Heuristics
Prajjwal Bhargava
Aleksandr Drozd
Anna Rogers
95
101
0
04 Oct 2021
1