ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.05822
  4. Cited By
Frequency-based Search-control in Dyna

Frequency-based Search-control in Dyna

International Conference on Learning Representations (ICLR), 2020
14 February 2020
Yangchen Pan
Jincheng Mei
Amir-massoud Farahmand
ArXiv (abs)PDFHTML

Papers citing "Frequency-based Search-control in Dyna"

9 / 9 papers shown
ArrayBot: Reinforcement Learning for Generalizable Distributed Manipulation through Touch
ArrayBot: Reinforcement Learning for Generalizable Distributed Manipulation through TouchIEEE International Conference on Robotics and Automation (ICRA), 2023
Zhengrong Xue
H. Zhang
Jin Cheng
Zhengmao He
Yuanchen Ju
Chan-Yu Lin
Gu Zhang
Huazhe Xu
OffRL
517
20
0
20 Feb 2025
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RLInternational Conference on Learning Representations (ICLR), 2024
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
481
12
0
11 Oct 2024
Meta-Gradient Search Control: A Method for Improving the Efficiency of
  Dyna-style Planning
Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning
Bradley Burega
John D. Martin
Luke Kapeluck
Michael Bowling
282
0
0
27 Jun 2024
Curious Replay for Model-based Adaptation
Curious Replay for Model-based AdaptationInternational Conference on Machine Learning (ICML), 2023
Isaac Kauvar
Christopher Doyle
Linqi Zhou
Nick Haber
184
18
0
28 Jun 2023
Memory-efficient Reinforcement Learning with Value-based Knowledge
  Consolidation
Memory-efficient Reinforcement Learning with Value-based Knowledge Consolidation
Qingfeng Lan
Yangchen Pan
Jun Luo
A. R. Mahmood
OffRL
586
9
0
22 May 2022
Self-Consistent Models and Values
Self-Consistent Models and ValuesNeural Information Processing Systems (NeurIPS), 2021
Roy Miles
Kate Baumli
Zita Marinho
Angelos Filos
Matteo Hessel
Hado van Hasselt
David Silver
268
9
0
25 Oct 2021
Understanding and Mitigating the Limitations of Prioritized Experience
  Replay
Understanding and Mitigating the Limitations of Prioritized Experience ReplayConference on Uncertainty in Artificial Intelligence (UAI), 2020
Yangchen Pan
Jincheng Mei
Amir-massoud Farahmand
Martha White
Hengshuai Yao
Mohsen Rohani
Jun Luo
450
22
0
19 Jul 2020
Learning to Sample with Local and Global Contexts in Experience Replay
  Buffer
Learning to Sample with Local and Global Contexts in Experience Replay BufferInternational Conference on Learning Representations (ICLR), 2020
Youngmin Oh
Kimin Lee
Jinwoo Shin
Eunho Yang
Sung Ju Hwang
OffRL
151
19
0
14 Jul 2020
An implicit function learning approach for parametric modal regression
An implicit function learning approach for parametric modal regressionNeural Information Processing Systems (NeurIPS), 2020
Yangchen Pan
Ehsan Imani
Martha White
Amir-massoud Farahmand
303
13
0
14 Feb 2020
1
Page 1 of 1