ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.05239
  4. Cited By
Learnable Behavior Control: Breaking Atari Human World Records via
  Sample-Efficient Behavior Selection

Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection

9 May 2023
Jiajun Fan
Yuzheng Zhuang
Yuecheng Liu
Jianye Hao
Bin Wang
Jiangcheng Zhu
Hao Wang
Shutao Xia
ArXivPDFHTML

Papers citing "Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection"

10 / 10 papers shown
Title
β\betaβ-DQN: Improving Deep Q-Learning By Evolving the Behavior
Hongming Zhang
Fengshuo Bai
Chenjun Xiao
Chao Gao
Bo Xu
Martin Müller
OffRL
23
2
0
03 Jan 2025
PRANCE: Joint Token-Optimization and Structural Channel-Pruning for
  Adaptive ViT Inference
PRANCE: Joint Token-Optimization and Structural Channel-Pruning for Adaptive ViT Inference
Ye Li
Chen Tang
Yuan Meng
Jiajun Fan
Zenghao Chai
Xinzhu Ma
Zhi Wang
Wenwu Zhu
31
1
0
06 Jul 2024
Mixture of Experts in a Mixture of RL settings
Mixture of Experts in a Mixture of RL settings
Timon Willi
J. Obando-Ceron
Jakob Foerster
Karolina Dziugaite
Pablo Samuel Castro
MoE
41
7
0
26 Jun 2024
One is More: Diverse Perspectives within a Single Network for Efficient
  DRL
One is More: Diverse Perspectives within a Single Network for Efficient DRL
Yiqin Tan
Ling Pan
Longbo Huang
OffRL
30
0
0
21 Oct 2023
Modeling Task Relationships in Multi-variate Soft Sensor with Balanced
  Mixture-of-Experts
Modeling Task Relationships in Multi-variate Soft Sensor with Balanced Mixture-of-Experts
Yuxin Huang
Hao Wang
Zhaoran Liu
Licheng Pan
Haozhe Li
Xinggao Liu
MoE
27
17
0
25 May 2023
PointPatchMix: Point Cloud Mixing with Patch Scoring
PointPatchMix: Point Cloud Mixing with Patch Scoring
Yi Wang
Jiaze Wang
Jinpeng Li
Zixu Zhao
Guangyong Chen
Anfeng Liu
Pheng-Ann Heng
3DPC
14
7
0
12 Mar 2023
TMoE-P: Towards the Pareto Optimum for Multivariate Soft Sensors
TMoE-P: Towards the Pareto Optimum for Multivariate Soft Sensors
Licheng Pan
Hao Wang
Zhichao Chen
Yuxin Huang
Xinggao Liu
10
0
0
21 Feb 2023
AttentionMixer: An Accurate and Interpretable Framework for Process
  Monitoring
AttentionMixer: An Accurate and Interpretable Framework for Process Monitoring
Hao Wang
Zhiyu Wang
Yunlong Niu
Zhaoran Liu
Haozhe Li
Yilin Liao
Yuxin Huang
Xinggao Liu
18
0
0
21 Feb 2023
Human-level Atari 200x faster
Human-level Atari 200x faster
Steven Kapturowski
Victor Campos
Ray Jiang
Nemanja Rakićević
Hado van Hasselt
Charles Blundell
Adria Puigdomenech Badia
OffRL
52
28
0
15 Sep 2022
Using Forwards-Backwards Models to Approximate MDP Homomorphisms
Using Forwards-Backwards Models to Approximate MDP Homomorphisms
Augustine N. Mavor-Parker
Matthew J. Sargent
Christian Pehle
Andrea Banino
Lewis D. Griffin
Caswell Barry
15
2
0
14 Sep 2022
1