ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.04821
  4. Cited By
Evolved Policy Gradients
v1v2 (latest)

Evolved Policy Gradients

13 February 2018
Rein Houthooft
Richard Y. Chen
Phillip Isola
Bradly C. Stadie
Filip Wolski
Jonathan Ho
Pieter Abbeel
ArXiv (abs)PDFHTML

Papers citing "Evolved Policy Gradients"

50 / 158 papers shown
Meta knowledge assisted Evolutionary Neural Architecture Search
Meta knowledge assisted Evolutionary Neural Architecture Search
Yangyang Li
Guanlong Liu
Ronghua Shang
L. Jiao
217
0
0
30 Apr 2025
EvoRL: A GPU-accelerated Framework for Evolutionary Reinforcement Learning
EvoRL: A GPU-accelerated Framework for Evolutionary Reinforcement Learning
Bowen Zheng
Ran Cheng
Kay Chen Tan
442
0
0
25 Jan 2025
Hierarchical Multi-agent Meta-Reinforcement Learning for Cross-channel
  Bidding
Hierarchical Multi-agent Meta-Reinforcement Learning for Cross-channel BiddingIEEE Transactions on Knowledge and Data Engineering (TKDE), 2024
Shenghong He
Chao Yu
290
3
0
26 Dec 2024
Adam on Local Time: Addressing Nonstationarity in RL with Relative Adam
  Timesteps
Adam on Local Time: Addressing Nonstationarity in RL with Relative Adam TimestepsNeural Information Processing Systems (NeurIPS), 2024
Benjamin Ellis
Matthew Jackson
Andrei Lupu
Alexander David Goldie
Mattie Fellows
Shimon Whiteson
Jakob Foerster
368
7
0
22 Dec 2024
Task-driven Image Fusion with Learnable Fusion Loss
Task-driven Image Fusion with Learnable Fusion LossComputer Vision and Pattern Recognition (CVPR), 2024
Haowen Bai
Jiangshe Zhang
Zixiang Zhao
Yichen Wu
Lilun Deng
Yukun Cui
Tao Feng
Shuang Xu
545
22
0
04 Dec 2024
Black box meta-learning intrinsic rewards for sparse-reward environments
Black box meta-learning intrinsic rewards for sparse-reward environments
Octavio Pappalardo
Rodrigo Ramele
Juan Miguel Santos
OffRL
308
1
0
31 Jul 2024
Behaviour Distillation
Behaviour Distillation
Andrei Lupu
Chris Xiaoxuan Lu
Jarek Liesen
R. T. Lange
Jakob Foerster
DD
277
8
0
21 Jun 2024
EvIL: Evolution Strategies for Generalisable Imitation Learning
EvIL: Evolution Strategies for Generalisable Imitation Learning
Silvia Sapora
Gokul Swamy
Chris Xiaoxuan Lu
Yee Whye Teh
Jakob Nicolaus Foerster
209
9
0
15 Jun 2024
Discovering Preference Optimization Algorithms with and for Large
  Language Models
Discovering Preference Optimization Algorithms with and for Large Language Models
Chris Xiaoxuan Lu
Samuel Holt
Claudio Fanconi
Alex J. Chan
Jakob Foerster
M. Schaar
R. T. Lange
OffRL
325
29
0
12 Jun 2024
Preparing for Black Swans: The Antifragility Imperative for Machine
  Learning
Preparing for Black Swans: The Antifragility Imperative for Machine Learning
Ming Jin
340
7
0
18 May 2024
Fast and Efficient Local Search for Genetic Programming Based Loss
  Function Learning
Fast and Efficient Local Search for Genetic Programming Based Loss Function Learning
Christian Raymond
Qi Chen
Bing Xue
Mengjie Zhang
353
4
0
01 Mar 2024
Evolutionary Reinforcement Learning: A Systematic Review and Future
  Directions
Evolutionary Reinforcement Learning: A Systematic Review and Future Directions
Y. Lin
Fan Lin
Guorong Cai
Hong Chen
Lixin Zou
Pengcheng Wu
221
8
0
20 Feb 2024
Discovering Temporally-Aware Reinforcement Learning Algorithms
Discovering Temporally-Aware Reinforcement Learning Algorithms
Matthew Jackson
Chris Xiaoxuan Lu
Louis Kirsch
R. T. Lange
Shimon Whiteson
Jakob N. Foerster
325
21
0
08 Feb 2024
Learning mirror maps in policy mirror descent
Learning mirror maps in policy mirror descent
Carlo Alfano
Sebastian Towers
Silvia Sapora
Chris Xiaoxuan Lu
Patrick Rebeschini
302
2
0
07 Feb 2024
ReFusion: Learning Image Fusion from Reconstruction with Learnable Loss via Meta-Learning
ReFusion: Learning Image Fusion from Reconstruction with Learnable Loss via Meta-LearningInternational Journal of Computer Vision (IJCV), 2023
Haowen Bai
Zixiang Zhao
Jiangshe Zhang
Yichen Wu
Lilun Deng
Yukun Cui
Shuang Xu
Baisong Jiang
421
36
0
13 Dec 2023
Adapt On-the-Go: Behavior Modulation for Single-Life Robot Deployment
Adapt On-the-Go: Behavior Modulation for Single-Life Robot Deployment
Annie S. Chen
Govind Chada
Laura M. Smith
Archit Sharma
Zipeng Fu
Sergey Levine
Chelsea Finn
512
9
0
02 Nov 2023
A Survey on Knowledge Editing of Neural Networks
A Survey on Knowledge Editing of Neural NetworksIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Vittorio Mazzia
Alessandro Pedrani
Andrea Caciolai
Kay Rottmann
Davide Bernardi
KELM
428
41
0
30 Oct 2023
Deep Model Predictive Optimization
Deep Model Predictive OptimizationIEEE International Conference on Robotics and Automation (ICRA), 2023
Jacob Sacks
Rwik Rana
Kevin Huang
Alex Spitzer
Guanya Shi
Byron Boots
256
16
0
06 Oct 2023
Discovering General Reinforcement Learning Algorithms with Adversarial
  Environment Design
Discovering General Reinforcement Learning Algorithms with Adversarial Environment DesignNeural Information Processing Systems (NeurIPS), 2023
Matthew Jackson
Minqi Jiang
Jack Parker-Holder
Risto Vuorio
Chris Xiaoxuan Lu
Gregory Farquhar
Shimon Whiteson
Jakob N. Foerster
OOD
245
19
0
04 Oct 2023
AdaptNet: Policy Adaptation for Physics-Based Character Control
AdaptNet: Policy Adaptation for Physics-Based Character ControlACM Transactions on Graphics (TOG), 2023
Pei Xu
Kaixiang Xie
Sheldon Andrews
P. Kry
Michael Neff
Morgan McGuire
Ioannis Karamouzas
Victor Zordan
TTA
455
29
0
30 Sep 2023
Diagnosing and exploiting the computational demands of videos games for
  deep reinforcement learning
Diagnosing and exploiting the computational demands of videos games for deep reinforcement learning
L. Govindarajan
Rex G Liu
Drew Linsley
A. Ashok
Max Reuter
M. Frank
Thomas Serre
OffRL
184
0
0
22 Sep 2023
Fine-grained Recognition with Learnable Semantic Data Augmentation
Fine-grained Recognition with Learnable Semantic Data AugmentationIEEE Transactions on Image Processing (IEEE TIP), 2023
Yifan Pu
Yizeng Han
Yulin Wang
Junlan Feng
Chao Deng
Gao Huang
325
53
0
01 Sep 2023
BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel
  Optimization
BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel OptimizationEuropean Conference on Artificial Intelligence (ECAI), 2023
Junyi Wang
Yuanyang Zhu
Zhi Wang
Yan Zheng
Jianye Hao
Chun-Han Chen
OffRL
159
1
0
01 Aug 2023
Acceleration in Policy Optimization
Acceleration in Policy Optimization
Veronica Chelu
Tom Zahavy
A. Guez
Doina Precup
Sebastian Flennerhag
339
0
0
18 Jun 2023
Empowering NLG: Offline Reinforcement Learning for Informal
  Summarization in Online Domains
Empowering NLG: Offline Reinforcement Learning for Informal Summarization in Online Domains
Zhiwei Tai
Po-Chuan Chen
OffRL
189
0
0
17 Jun 2023
Fast Context Adaptation in Cost-Aware Continual Learning
Fast Context Adaptation in Cost-Aware Continual LearningIEEE Transactions on Machine Learning in Communications and Networking (IEEE TMLCN), 2023
Seyyidahmed Lahmer
Federico Mason
Federico Chiariotti
Andrea Zanella
194
3
0
06 Jun 2023
Efficient automatic design of robots
Efficient automatic design of robotsProceedings of the National Academy of Sciences of the United States of America (PNAS), 2023
David Matthews
Andrew Spielberg
Daniela Rus
Sam Kriegman
Josh Bongard
174
35
0
05 Jun 2023
DAC-MR: Data Augmentation Consistency Based Meta-Regularization for
  Meta-Learning
DAC-MR: Data Augmentation Consistency Based Meta-Regularization for Meta-Learning
Jun Shu
Xiang Yuan
Deyu Meng
Zongben Xu
335
5
0
13 May 2023
Policy Gradient Algorithms Implicitly Optimize by Continuation
Policy Gradient Algorithms Implicitly Optimize by Continuation
Adrien Bolland
Gilles Louppe
D. Ernst
303
4
0
11 May 2023
Structured State Space Models for In-Context Reinforcement Learning
Structured State Space Models for In-Context Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Chris Xiaoxuan Lu
Yannick Schroecker
Albert Gu
Emilio Parisotto
Jakob N. Foerster
Satinder Singh
Feryal M. P. Behbahani
AI4TS
554
134
0
07 Mar 2023
Evolutionary Reinforcement Learning: A Survey
Evolutionary Reinforcement Learning: A SurveyIntelligent Computing (IC), 2023
Hui Bai
Ran Cheng
Yaochu Jin
OffRL
480
85
0
07 Mar 2023
Unsupervised Meta-Learning via Few-shot Pseudo-supervised Contrastive
  Learning
Unsupervised Meta-Learning via Few-shot Pseudo-supervised Contrastive LearningInternational Conference on Learning Representations (ICLR), 2023
Huiwon Jang
Hankook Lee
Jinwoo Shin
VLMSSL
228
27
0
02 Mar 2023
Hebbian and Gradient-based Plasticity Enables Robust Memory and Rapid
  Learning in RNNs
Hebbian and Gradient-based Plasticity Enables Robust Memory and Rapid Learning in RNNsInternational Conference on Learning Representations (ICLR), 2023
Y. Duan
Zhongfan Jia
Qian Li
Yi Zhong
Kaisheng Ma
AAML
233
4
0
07 Feb 2023
Learning to Optimize for Reinforcement Learning
Learning to Optimize for Reinforcement Learning
Qingfeng Lan
Rupam Mahmood
Shuicheng Yan
Zhongwen Xu
OffRL
455
10
0
03 Feb 2023
Rewarded meta-pruning: Meta Learning with Rewards for Channel Pruning
Rewarded meta-pruning: Meta Learning with Rewards for Channel Pruning
Athul Shibu
Abhishek Kumar
Heechul Jung
Dong-Gyu Lee
217
2
0
26 Jan 2023
AutoCost: Evolving Intrinsic Cost for Zero-violation Reinforcement
  Learning
AutoCost: Evolving Intrinsic Cost for Zero-violation Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2023
Tairan He
Weiye Zhao
Changliu Liu
OffRL
225
21
0
24 Jan 2023
General-Purpose In-Context Learning by Meta-Learning Transformers
General-Purpose In-Context Learning by Meta-Learning Transformers
Louis Kirsch
James Harrison
Jascha Narain Sohl-Dickstein
Luke Metz
436
107
0
08 Dec 2022
Implicit Training of Energy Model for Structure Prediction
Implicit Training of Energy Model for Structure Prediction
Shiv Shankar
Vihari Piratla
214
0
0
21 Nov 2022
Simple Emergent Action Representations from Multi-Task Policy Training
Simple Emergent Action Representations from Multi-Task Policy TrainingInternational Conference on Learning Representations (ICLR), 2022
Pu Hua
Yubei Chen
Huazhe Xu
MLAU
206
7
0
18 Oct 2022
Reinforcement Learning with Automated Auxiliary Loss Search
Reinforcement Learning with Automated Auxiliary Loss SearchNeural Information Processing Systems (NeurIPS), 2022
Tairan He
Yuge Zhang
Kan Ren
Minghuan Liu
Che Wang
Weinan Zhang
Yuqing Yang
Dongsheng Li
295
18
0
12 Oct 2022
Discovered Policy Optimisation
Discovered Policy OptimisationNeural Information Processing Systems (NeurIPS), 2022
Chris Xiaoxuan Lu
J. Kuba
Alistair Letcher
Luke Metz
Christian Schroeder de Witt
Jakob N. Foerster
OffRL
364
113
0
11 Oct 2022
Learning Symbolic Model-Agnostic Loss Functions via Meta-Learning
Learning Symbolic Model-Agnostic Loss Functions via Meta-LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Christian Raymond
Qi Chen
Bing Xue
Mengjie Zhang
FedML
258
17
0
19 Sep 2022
Learning to learn online with neuromodulated synaptic plasticity in
  spiking neural networks
Learning to learn online with neuromodulated synaptic plasticity in spiking neural networksbioRxiv (bioRxiv), 2022
Samuel Schmidgall
Joe Hays
331
3
0
25 Jun 2022
Robust Task Representations for Offline Meta-Reinforcement Learning via
  Contrastive Learning
Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive LearningInternational Conference on Machine Learning (ICML), 2022
Haoqi Yuan
Zongqing Lu
SSLOffRL
231
51
0
21 Jun 2022
A Survey on Model-based Reinforcement Learning
A Survey on Model-based Reinforcement LearningScience China Information Sciences (Sci. China Inf. Sci.), 2022
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRLLRM
369
156
0
19 Jun 2022
On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning
On the Effectiveness of Fine-tuning Versus Meta-reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Mandi Zhao
Pieter Abbeel
Stephen James
OffRL
423
38
0
07 Jun 2022
A Comprehensive Survey of Few-shot Learning: Evolution, Applications,
  Challenges, and Opportunities
A Comprehensive Survey of Few-shot Learning: Evolution, Applications, Challenges, and OpportunitiesACM Computing Surveys (ACM CSUR), 2022
Yisheng Song
Ting-Yuan Wang
S. Mondal
J. P. Sahoo
SLR
392
618
0
13 May 2022
Evolving Pareto-Optimal Actor-Critic Algorithms for Generalizability and
  Stability
Evolving Pareto-Optimal Actor-Critic Algorithms for Generalizability and Stability
Juan Jose Garau-Luis
Yingjie Miao
John D. Co-Reyes
Aaron T Parisi
Jie Tan
Esteban Real
Aleksandra Faust
251
0
0
08 Apr 2022
Model Based Meta Learning of Critics for Policy Gradients
Model Based Meta Learning of Critics for Policy Gradients
Sarah Bechtle
Ludovic Righetti
Franziska Meier
OffRL
113
0
0
05 Apr 2022
Meta-Reinforcement Learning with Self-Modifying Networks
Meta-Reinforcement Learning with Self-Modifying NetworksNeural Information Processing Systems (NeurIPS), 2022
Mathieu Chalvidal
Thomas Serre
Rufin VanRullen
KELM
249
8
0
04 Feb 2022
1234
Next
Page 1 of 4