ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.06606
  4. Cited By
Prototypical Reward Network for Data-Efficient RLHF

Prototypical Reward Network for Data-Efficient RLHF

6 June 2024
Jinghan Zhang
Xiting Wang
Yiqiao Jin
Changyu Chen
Xinhao Zhang
Kunpeng Liu
    ALM
ArXivPDFHTML

Papers citing "Prototypical Reward Network for Data-Efficient RLHF"

6 / 6 papers shown
Title
DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models
DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models
J. Liu
Hangyu Guo
Ranjie Duan
Xingyuan Bu
Yancheng He
...
Yingshui Tan
Yanan Wu
Jihao Gu
Y. Li
J. Zhu
MLLM
79
0
0
25 Apr 2025
Applications of Large Language Model Reasoning in Feature Generation
Applications of Large Language Model Reasoning in Feature Generation
Dharani Chandra
LRM
63
0
0
15 Mar 2025
Diversity-Oriented Data Augmentation with Large Language Models
Diversity-Oriented Data Augmentation with Large Language Models
Zaitian Wang
Jinghan Zhang
Xinhao Zhang
Kunpeng Liu
Pengfei Wang
Yuanchun Zhou
78
1
0
17 Feb 2025
Better to Ask in English: Cross-Lingual Evaluation of Large Language
  Models for Healthcare Queries
Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries
Yiqiao Jin
Mohit Chandra
Gaurav Verma
Yibo Hu
Munmun De Choudhury
Srijan Kumar
LM&MA
ELM
87
65
0
19 Oct 2023
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
303
11,730
0
04 Mar 2022
Fine-Tuning Language Models from Human Preferences
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
275
1,561
0
18 Sep 2019
1