ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.12750
  4. Cited By
Learning Multimodal Rewards from Rankings

Learning Multimodal Rewards from Rankings

27 September 2021
Vivek Myers
Erdem Biyik
Nima Anari
Dorsa Sadigh
    OffRL
ArXivPDFHTML

Papers citing "Learning Multimodal Rewards from Rankings"

10 / 10 papers shown
Title
Learning to Assist Humans without Inferring Rewards
Learning to Assist Humans without Inferring Rewards
Vivek Myers
Evan Ellis
Sergey Levine
Benjamin Eysenbach
Anca Dragan
30
2
0
17 Jan 2025
Teaching Language Models to Self-Improve by Learning from Language
  Feedback
Teaching Language Models to Self-Improve by Learning from Language Feedback
Chi Hu
Yimin Hu
Hang Cao
Tong Xiao
Jingbo Zhu
LRM
VLM
25
4
0
11 Jun 2024
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Calarina Muslimani
M. E. Taylor
OffRL
38
2
0
30 Apr 2024
A Generalized Acquisition Function for Preference-based Reward Learning
A Generalized Acquisition Function for Preference-based Reward Learning
Evan Ellis
Gaurav R. Ghosal
Stuart J. Russell
Anca Dragan
Erdem Biyik
29
1
0
09 Mar 2024
REBOOT: Reuse Data for Bootstrapping Efficient Real-World Dexterous
  Manipulation
REBOOT: Reuse Data for Bootstrapping Efficient Real-World Dexterous Manipulation
Zheyuan Hu
Aaron Rovinsky
Jianlan Luo
Vikash Kumar
Abhishek Gupta
Sergey Levine
OffRL
14
9
0
06 Sep 2023
Active Inverse Learning in Stackelberg Trajectory Games
Active Inverse Learning in Stackelberg Trajectory Games
Yue Yu
Jacob Levy
Negar Mehr
David Fridovich-Keil
Ufuk Topcu
11
2
0
15 Aug 2023
Probabilistic Conformal Prediction Using Conditional Random Samples
Probabilistic Conformal Prediction Using Conditional Random Samples
Zhendong Wang
Ruijiang Gao
Mingzhang Yin
Mingyuan Zhou
David M. Blei
TPM
27
22
0
14 Jun 2022
Learning from Imperfect Demonstrations via Adversarial Confidence
  Transfer
Learning from Imperfect Demonstrations via Adversarial Confidence Transfer
Zhangjie Cao
Zihan Wang
Dorsa Sadigh
AAML
19
7
0
07 Feb 2022
Preference-Based Learning for Exoskeleton Gait Optimization
Preference-Based Learning for Exoskeleton Gait Optimization
Maegan Tucker
Ellen R. Novoseller
Claudia K. Kann
Yanan Sui
Yisong Yue
J. W. Burdick
Aaron D. Ames
66
89
0
26 Sep 2019
Early Detection of Combustion Instabilities using Deep Convolutional
  Selective Autoencoders on Hi-speed Flame Video
Early Detection of Combustion Instabilities using Deep Convolutional Selective Autoencoders on Hi-speed Flame Video
Chandrayee Basu
Qian Yang
M. Singhal
Anca Dragan
49
174
0
25 Mar 2016
1