Scaling Autonomous Agents via Automatic Reward Modeling And PlanningInternational Conference on Learning Representations (ICLR), 2025 |
Personality Alignment of Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024 |
Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for
Cartoon CaptioningNeural Information Processing Systems (NeurIPS), 2024 Jifan Zhang Lalit P. Jain Yang Guo Jiayi Chen Kuan Lok Zhou ...Scott Sievert Timothy T. Rogers Kevin Jamieson Robert Mankoff Robert Nowak |