ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.10284
  4. Cited By
From "Thumbs Up" to "10 out of 10": Reconsidering Scalar Feedback in
  Interactive Reinforcement Learning

From "Thumbs Up" to "10 out of 10": Reconsidering Scalar Feedback in Interactive Reinforcement Learning

17 November 2023
Hang Yu
Reuben M. Aronson
Katherine H. Allen
E. Short
ArXivPDFHTML

Papers citing "From "Thumbs Up" to "10 out of 10": Reconsidering Scalar Feedback in Interactive Reinforcement Learning"

3 / 3 papers shown
Title
Enhancing Preference-based Linear Bandits via Human Response Time
Enhancing Preference-based Linear Bandits via Human Response Time
Shen Li
Yuyang Zhang
Zhaolin Ren
Claire Liang
Na Li
J. Shah
34
0
0
03 Jan 2025
How Much Progress Did I Make? An Unexplored Human Feedback Signal for
  Teaching Robots
How Much Progress Did I Make? An Unexplored Human Feedback Signal for Teaching Robots
Hang Yu
Qidi Fang
Shijie Fang
Reuben M. Aronson
E. Short
20
0
0
08 Jul 2024
Self-Initiated Open World Learning for Autonomous AI Agents
Self-Initiated Open World Learning for Autonomous AI Agents
Bing-Quan Liu
Eric Robertson
Scott Grigsby
Sahisnu Mazumder
AI4CE
30
8
0
21 Oct 2021
1