Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.12349
Cited By
Reward Learning as Doubly Nonparametric Bandits: Optimal Design and Scaling Laws
23 February 2023
Kush S. Bhatia
Wenshuo Guo
Jacob Steinhardt
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Reward Learning as Doubly Nonparametric Bandits: Optimal Design and Scaling Laws"
1 / 1 papers shown
Title
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
237
4,469
0
23 Jan 2020
1