ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.07198
  4. Cited By
Learning to Generalize from Sparse and Underspecified Rewards

Learning to Generalize from Sparse and Underspecified Rewards

19 February 2019
Rishabh Agarwal
Chen Liang
Dale Schuurmans
Mohammad Norouzi
    OffRL
ArXivPDFHTML

Papers citing "Learning to Generalize from Sparse and Underspecified Rewards"

19 / 19 papers shown
Title
Reinforcement Learning from Multi-level and Episodic Human Feedback
Reinforcement Learning from Multi-level and Episodic Human Feedback
Muhammad Qasim Elahi
Somtochukwu Oguchienti
Maheed H. Ahmed
Mahsa Ghasemi
OffRL
50
0
0
20 Apr 2025
Learning Autonomous Code Integration for Math Language Models
Learning Autonomous Code Integration for Math Language Models
Haozhe Wang
Long Li
C. Qu
Fengming Zhu
Weidi Xu
Wei Chu
Fangzhen Lin
56
1
0
02 Feb 2025
From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction
  Tuning
From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning
Qian Liu
Fan Zhou
Zhengbao Jiang
Longxu Dou
Min-Bin Lin
18
17
0
17 Apr 2023
UAV Obstacle Avoidance by Human-in-the-Loop Reinforcement in Arbitrary
  3D Environment
UAV Obstacle Avoidance by Human-in-the-Loop Reinforcement in Arbitrary 3D Environment
Xuyang Li
Jianwu Fang
Kai Du
K. Mei
Jianru Xue
16
6
0
07 Apr 2023
Natural Language-conditioned Reinforcement Learning with Inside-out Task
  Language Development and Translation
Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation
Jing-Cheng Pang
Xinyi Yang
Sibei Yang
Yang Yu
29
8
0
18 Feb 2023
UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice
  Synthesis
UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis
Yinjiao Lei
Shan Yang
Xinsheng Wang
Qicong Xie
Jixun Yao
Linfu Xie
Dan Su
DiffM
21
8
0
03 Dec 2022
Large Language Models are few(1)-shot Table Reasoners
Large Language Models are few(1)-shot Table Reasoners
Wenhu Chen
LMTD
ReLM
LRM
22
138
0
13 Oct 2022
Binding Language Models in Symbolic Languages
Binding Language Models in Symbolic Languages
Zhoujun Cheng
Tianbao Xie
Peng Shi
Chengzu Li
Rahul Nadkarni
...
Dragomir R. Radev
Mari Ostendorf
Luke Zettlemoyer
Noah A. Smith
Tao Yu
LMTD
122
198
0
06 Oct 2022
On The Ingredients of an Effective Zero-shot Semantic Parser
On The Ingredients of an Effective Zero-shot Semantic Parser
Pengcheng Yin
John Wieting
Avirup Sil
Graham Neubig
50
15
0
15 Oct 2021
SPARQLing Database Queries from Intermediate Question Decompositions
SPARQLing Database Queries from Intermediate Question Decompositions
Irina Saparina
A. Osokin
21
14
0
13 Sep 2021
TAPEX: Table Pre-training via Learning a Neural SQL Executor
TAPEX: Table Pre-training via Learning a Neural SQL Executor
Qian Liu
Bei Chen
Jiaqi Guo
Morteza Ziyadi
Zeqi Lin
Weizhu Chen
Jian-Guang Lou
LMTD
25
259
0
16 Jul 2021
SeqGenSQL -- A Robust Sequence Generation Model for Structured Query
  Language
SeqGenSQL -- A Robust Sequence Generation Model for Structured Query Language
Ning Li
Bethany Keller
M. Butler
Daniel Cer
18
8
0
07 Nov 2020
On the Potential of Lexico-logical Alignments for Semantic Parsing to
  SQL Queries
On the Potential of Lexico-logical Alignments for Semantic Parsing to SQL Queries
Tianze Shi
Chen Zhao
Jordan L. Boyd-Graber
Hal Daumé
Lillian Lee
32
78
0
21 Oct 2020
GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing
GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing
Tao Yu
Chien-Sheng Wu
Xi Lin
Bailin Wang
Y. Tan
Xinyi Yang
Dragomir R. Radev
R. Socher
Caiming Xiong
LMTD
30
247
0
29 Sep 2020
Guiding Deep Molecular Optimization with Genetic Exploration
Guiding Deep Molecular Optimization with Genetic Exploration
Sungsoo Ahn
Junsup Kim
Hankook Lee
Jinwoo Shin
29
70
0
04 Jul 2020
TAPAS: Weakly Supervised Table Parsing via Pre-training
TAPAS: Weakly Supervised Table Parsing via Pre-training
Jonathan Herzig
Pawel Krzysztof Nowak
Thomas Müller
Francesco Piccinno
Julian Martin Eisenschlos
LMTD
RALM
33
633
0
05 Apr 2020
Predictive Coding for Boosting Deep Reinforcement Learning with Sparse
  Rewards
Predictive Coding for Boosting Deep Reinforcement Learning with Sparse Rewards
Xingyu Lu
Stas Tiomkin
Pieter Abbeel
OffRL
29
4
0
21 Dec 2019
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
362
11,700
0
09 Mar 2017
Simpler Context-Dependent Logical Forms via Model Projections
Simpler Context-Dependent Logical Forms via Model Projections
R. Long
Panupong Pasupat
Percy Liang
204
102
0
16 Jun 2016
1