ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.01309
  4. Cited By
REvolve: Reward Evolution with Large Language Models using Human Feedback

REvolve: Reward Evolution with Large Language Models using Human Feedback

3 June 2024
Rishi Hazra
Alkis Sygkounas
A. Persson
Amy Loutfi
Pedro Zuidberg Dos Martires
ArXivPDFHTML

Papers citing "REvolve: Reward Evolution with Large Language Models using Human Feedback"

11 / 11 papers shown
Title
Interactive Double Deep Q-network: Integrating Human Interventions and Evaluative Predictions in Reinforcement Learning of Autonomous Driving
Interactive Double Deep Q-network: Integrating Human Interventions and Evaluative Predictions in Reinforcement Learning of Autonomous Driving
Alkis Sygkounas
Ioannis Athanasiadis
A. Persson
M. Felsberg
Amy Loutfi
OffRL
28
0
0
28 Apr 2025
Urban Computing in the Era of Large Language Models
Urban Computing in the Era of Large Language Models
Zhonghang Li
Lianghao Xia
Xubin Ren
J. Tang
Tianyi Chen
Yong-mei Xu
C. Huang
73
0
0
02 Apr 2025
Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards
Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards
Shresth Verma
Niclas Boehmer
Lingkai Kong
Milind Tambe
69
2
0
17 Jan 2025
DrEureka: Language Model Guided Sim-To-Real Transfer
DrEureka: Language Model Guided Sim-To-Real Transfer
Yecheng Jason Ma
William Liang
Hung-Ju Wang
Sam Wang
Yuke Zhu
Linxi Fan
Osbert Bastani
Dinesh Jayaraman
71
39
0
04 Jun 2024
Large Language Models to Enhance Bayesian Optimization
Large Language Models to Enhance Bayesian Optimization
Tennison Liu
Nicolás Astorga
Nabeel Seedat
M. Schaar
58
45
0
06 Feb 2024
NEWTON: Are Large Language Models Capable of Physical Reasoning?
NEWTON: Are Large Language Models Capable of Physical Reasoning?
Yi Ru Wang
Jiafei Duan
Dieter Fox
S. Srinivasa
ELM
LRM
AIMat
ReLM
62
22
0
10 Oct 2023
How Simulation Helps Autonomous Driving:A Survey of Sim2real, Digital
  Twins, and Parallel Intelligence
How Simulation Helps Autonomous Driving:A Survey of Sim2real, Digital Twins, and Parallel Intelligence
Xuemin Hu
Shen Li
Ting Huang
Bo Tang
Rouxing Huai
Long Chen
44
64
0
02 May 2023
Instruction Tuning with GPT-4
Instruction Tuning with GPT-4
Baolin Peng
Chunyuan Li
Pengcheng He
Michel Galley
Jianfeng Gao
SyDa
ALM
LM&MA
157
579
0
06 Apr 2023
SORNet: Spatial Object-Centric Representations for Sequential
  Manipulation
SORNet: Spatial Object-Centric Representations for Sequential Manipulation
Wentao Yuan
Chris Paxton
Karthik Desingh
D. Fox
3DPC
137
72
0
08 Sep 2021
Reward (Mis)design for Autonomous Driving
Reward (Mis)design for Autonomous Driving
W. B. Knox
A. Allievi
Holger Banzhaf
Felix Schmitt
Peter Stone
67
112
0
28 Apr 2021
Extracting Training Data from Large Language Models
Extracting Training Data from Large Language Models
Nicholas Carlini
Florian Tramèr
Eric Wallace
Matthew Jagielski
Ariel Herbert-Voss
...
Tom B. Brown
D. Song
Ulfar Erlingsson
Alina Oprea
Colin Raffel
MLAU
SILM
267
1,808
0
14 Dec 2020
1