ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1703.03633
  4. Cited By
Learning Gradient Descent: Better Generalization and Longer Horizons
v1v2v3 (latest)

Learning Gradient Descent: Better Generalization and Longer Horizons

10 March 2017
Kaifeng Lyu
Shunhua Jiang
Jian Li
ArXiv (abs)PDFHTML

Papers citing "Learning Gradient Descent: Better Generalization and Longer Horizons"

50 / 65 papers shown
Enhancing Fractional Gradient Descent with Learned Optimizers
Enhancing Fractional Gradient Descent with Learned Optimizers
Jan Sobotka
Petr Simánek
Pavel Kordík
122
0
0
21 Oct 2025
Learning a Zeroth-Order Optimizer for Fine-Tuning LLMs
Learning a Zeroth-Order Optimizer for Fine-Tuning LLMs
Kairun Zhang
Xue Yang
Yanjun Zhao
Yifan Sun
Huan Zhang
193
0
0
01 Oct 2025
Learning from Oblivion: Predicting Knowledge Overflowed Weights via Retrodiction of Forgetting
Learning from Oblivion: Predicting Knowledge Overflowed Weights via Retrodiction of Forgetting
Jinhyeok Jang
Jaehong Kim
Jung Uk Kim
MUKELM
148
0
0
07 Aug 2025
Towards Robust Learning to Optimize with Theoretical Guarantees
Towards Robust Learning to Optimize with Theoretical GuaranteesComputer Vision and Pattern Recognition (CVPR), 2024
Qingyu Song
Wei Lin
Juncheng Wang
Hong Xu
235
3
0
17 Jun 2025
Accelerating Optimization via Differentiable Stopping Time
Accelerating Optimization via Differentiable Stopping Time
Zhonglin Xie
Yiman Fong
Haoran Yuan
Zaiwen Wen
157
0
0
28 May 2025
Efficient End-to-End Learning for Decision-Making: A Meta-Optimization Approach
Efficient End-to-End Learning for Decision-Making: A Meta-Optimization Approach
Rares Cristian
Pavithra Harsha
Georgia Perakis
Brian Quanz
310
1
0
16 May 2025
Optimization Problem Solving Can Transition to Evolutionary Agentic Workflows
Optimization Problem Solving Can Transition to Evolutionary Agentic Workflows
Wenhao Li
Bo Jin
Mingyi Hong
Changhong Lu
Xiangfeng Wang
494
0
0
07 May 2025
Make Optimization Once and for All with Fine-grained Guidance
Make Optimization Once and for All with Fine-grained Guidance
Mingjia Shi
Ruihan Lin
Xuxi Chen
Yuhao Zhou
Zezhen Ding
...
Tong Wang
Xiaojiang Peng
Zhangyang Wang
Jing Zhang
Tianlong Chen
337
1
0
14 Mar 2025
Meta-Sparsity: Learning Optimal Sparse Structures in Multi-task Networks through Meta-learning
Meta-Sparsity: Learning Optimal Sparse Structures in Multi-task Networks through Meta-learning
Richa Upadhyay
Ronald Phlypo
Rajkumar Saini
Marcus Liwicki
333
0
0
21 Jan 2025
Narrowing the Focus: Learned Optimizers for Pretrained Models
Narrowing the Focus: Learned Optimizers for Pretrained Models
Gus Kristiansen
Mark Sandler
A. Zhmoginov
Nolan Miller
Anirudh Goyal
Jihwan Lee
Max Vladymyrov
348
2
0
17 Aug 2024
Semantic are Beacons: A Semantic Perspective for Unveiling
  Parameter-Efficient Fine-Tuning in Knowledge Learning
Semantic are Beacons: A Semantic Perspective for Unveiling Parameter-Efficient Fine-Tuning in Knowledge Learning
Renzhi Wang
Piji Li
230
5
0
28 May 2024
From Learning to Optimize to Learning Optimization Algorithms
From Learning to Optimize to Learning Optimization Algorithms
Camille Castera
Peter Ochs
497
1
0
28 May 2024
Artificial Intelligence for Operations Research: Revolutionizing the
  Operations Research Process
Artificial Intelligence for Operations Research: Revolutionizing the Operations Research Process
Zhenan Fan
Bissan Ghaddar
Xinglu Wang
Linzi Xing
Yong Zhang
Zirui Zhou
AI4CE
276
22
0
06 Jan 2024
Investigation into the Training Dynamics of Learned Optimizers
Investigation into the Training Dynamics of Learned OptimizersInternational Conference on Agents and Artificial Intelligence (ICAART), 2023
Jan Sobotka
Petr Simánek
Daniel Vasata
273
0
0
12 Dec 2023
Meta-learning Optimizers for Communication-Efficient Learning
Meta-learning Optimizers for Communication-Efficient Learning
Charles-Étienne Joseph
Benjamin Thérien
A. Moudgil
Boris Knyazev
Eugene Belilovsky
452
2
0
02 Dec 2023
Learning to optimize by multi-gradient for multi-objective optimization
Learning to optimize by multi-gradient for multi-objective optimization
Linxi Yang
Xinmin Yang
L. Tang
293
1
0
01 Nov 2023
Is Scaling Learned Optimizers Worth It? Evaluating The Value of VeLO's
  4000 TPU Months
Is Scaling Learned Optimizers Worth It? Evaluating The Value of VeLO's 4000 TPU Months
Fady Rezk
Antreas Antoniou
Henry Gouk
Timothy M. Hospedales
ELM
286
3
0
27 Oct 2023
Deep Model Predictive Optimization
Deep Model Predictive OptimizationIEEE International Conference on Robotics and Automation (ICRA), 2023
Jacob Sacks
Rwik Rana
Kevin Huang
Alex Spitzer
Guanya Shi
Byron Boots
315
16
0
06 Oct 2023
Towards Constituting Mathematical Structures for Learning to Optimize
Towards Constituting Mathematical Structures for Learning to OptimizeInternational Conference on Machine Learning (ICML), 2023
Jialin Liu
Xiaohan Chen
Zinan Lin
W. Yin
HanQin Cai
376
19
0
29 May 2023
Stochastic Unrolled Federated Learning
Stochastic Unrolled Federated Learning
Samar Hadou
Navid Naderializadeh
Alejandro Ribeiro
FedML
391
9
0
24 May 2023
Improving physics-informed neural networks with meta-learned
  optimization
Improving physics-informed neural networks with meta-learned optimizationJournal of machine learning research (JMLR), 2023
Alexander Bihlo
PINN
301
31
0
13 Mar 2023
M-L2O: Towards Generalizable Learning-to-Optimize by Test-Time Fast
  Self-Adaptation
M-L2O: Towards Generalizable Learning-to-Optimize by Test-Time Fast Self-AdaptationInternational Conference on Learning Representations (ICLR), 2023
Junjie Yang
Xuxi Chen
Tianlong Chen
Zinan Lin
Yitao Liang
142
4
0
28 Feb 2023
Learning to Generalize Provably in Learning to Optimize
Learning to Generalize Provably in Learning to OptimizeInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Junjie Yang
Tianlong Chen
Mingkang Zhu
Fengxiang He
Dacheng Tao
Yitao Liang
Zinan Lin
261
11
0
22 Feb 2023
Learning to Optimize for Reinforcement Learning
Learning to Optimize for Reinforcement Learning
Qingfeng Lan
Rupam Mahmood
Shuicheng Yan
Zhongwen Xu
OffRL
493
10
0
03 Feb 2023
Mnemosyne: Learning to Train Transformers with Transformers
Mnemosyne: Learning to Train Transformers with TransformersNeural Information Processing Systems (NeurIPS), 2023
Deepali Jain
K. Choromanski
Kumar Avinava Dubey
Sumeet Singh
Vikas Sindhwani
Tingnan Zhang
Jie Tan
OffRL
569
14
0
02 Feb 2023
Federated Automatic Differentiation
Federated Automatic Differentiation
Keith Rush
Zachary B. Charles
Zachary Garrett
FedML
307
1
0
18 Jan 2023
Learning to Optimize in Model Predictive Control
Learning to Optimize in Model Predictive ControlIEEE International Conference on Robotics and Automation (ICRA), 2022
Jacob Sacks
Byron Boots
316
28
0
05 Dec 2022
Learning to Optimize with Dynamic Mode Decomposition
Learning to Optimize with Dynamic Mode DecompositionIEEE International Joint Conference on Neural Network (IJCNN), 2022
Petr Simánek
Daniel Vasata
Pavel Kordík
183
6
0
29 Nov 2022
VeLO: Training Versatile Learned Optimizers by Scaling Up
VeLO: Training Versatile Learned Optimizers by Scaling Up
Luke Metz
James Harrison
C. Freeman
Amil Merchant
Lucas Beyer
...
Naman Agrawal
Ben Poole
Igor Mordatch
Adam Roberts
Jascha Narain Sohl-Dickstein
358
78
0
17 Nov 2022
Learning to Optimize Quasi-Newton Methods
Learning to Optimize Quasi-Newton Methods
Isaac Liao
Rumen Dangovski
Jakob N. Foerster
Marin Soljacic
300
4
0
11 Oct 2022
Learning to Learn with Generative Models of Neural Network Checkpoints
Learning to Learn with Generative Models of Neural Network Checkpoints
William S. Peebles
Ilija Radosavovic
Tim Brooks
Alexei A. Efros
Jitendra Malik
UQCV
301
90
0
26 Sep 2022
A Closer Look at Learned Optimization: Stability, Robustness, and
  Inductive Biases
A Closer Look at Learned Optimization: Stability, Robustness, and Inductive BiasesNeural Information Processing Systems (NeurIPS), 2022
James Harrison
Luke Metz
Jascha Narain Sohl-Dickstein
258
33
0
22 Sep 2022
Gradient-based Bi-level Optimization for Deep Learning: A Survey
Gradient-based Bi-level Optimization for Deep Learning: A Survey
Can Chen
Xiangshan Chen
Chen Ma
Zixuan Liu
Xue Liu
580
46
0
24 Jul 2022
Automated Dynamic Algorithm Configuration
Automated Dynamic Algorithm ConfigurationJournal of Artificial Intelligence Research (JAIR), 2022
Steven Adriaensen
André Biedenkapp
Gresa Shala
Noor H. Awad
Theresa Eimer
Marius Lindauer
Katharina Eggensperger
355
50
0
27 May 2022
Practical tradeoffs between memory, compute, and performance in learned
  optimizers
Practical tradeoffs between memory, compute, and performance in learned optimizers
Luke Metz
C. Freeman
James Harrison
Niru Maheswaranathan
Jascha Narain Sohl-Dickstein
502
39
0
22 Mar 2022
Symbolic Learning to Optimize: Towards Interpretability and Scalability
Symbolic Learning to Optimize: Towards Interpretability and ScalabilityInternational Conference on Learning Representations (ICLR), 2022
Wenqing Zheng
Tianlong Chen
Ting-Kuei Hu
Zinan Lin
554
21
0
13 Mar 2022
Optimizer Amalgamation
Optimizer AmalgamationInternational Conference on Learning Representations (ICLR), 2022
Tianshu Huang
Tianlong Chen
Sijia Liu
Shiyu Chang
Lisa Amini
Zinan Lin
MoMe
264
5
0
12 Mar 2022
Tutorial on amortized optimization
Tutorial on amortized optimization
Brandon Amos
OffRL
898
80
0
01 Feb 2022
A Simple Guard for Learned Optimizers
A Simple Guard for Learned OptimizersInternational Conference on Machine Learning (ICML), 2022
Isabeau Prémont-Schwarz
Jaroslav Vítkru
Jan Feyereisl
392
10
0
28 Jan 2022
ModelPred: A Framework for Predicting Trained Model from Training Data
ModelPred: A Framework for Predicting Trained Model from Training Data
Yingyan Zeng
Jiachen T. Wang
Si-An Chen
H. Just
Ran Jin
R. Jia
TDIMU
295
5
0
24 Nov 2021
Self-Learning Tuning for Post-Silicon Validation
Self-Learning Tuning for Post-Silicon Validation
P. Domanski
Dirk Pflüger
J. Rivoir
Raphael Latty
125
6
0
17 Nov 2021
Gradients are Not All You Need
Gradients are Not All You Need
Luke Metz
C. Freeman
S. Schoenholz
Tal Kachman
296
104
0
10 Nov 2021
Efficient Meta Subspace Optimization
Efficient Meta Subspace Optimization
Yoni Choukroun
Michael Katz
215
1
0
28 Oct 2021
Model-Agnostic Meta-Attack: Towards Reliable Evaluation of Adversarial
  Robustness
Model-Agnostic Meta-Attack: Towards Reliable Evaluation of Adversarial Robustness
Xiao Yang
Yinpeng Dong
Wenzhao Xiang
Tianyu Pang
Hang Su
Jun Zhu
AAML
146
4
0
13 Oct 2021
Bootstrapped Meta-Learning
Bootstrapped Meta-LearningInternational Conference on Learning Representations (ICLR), 2021
Sebastian Flennerhag
Yannick Schroecker
Tom Zahavy
Hado van Hasselt
David Silver
Satinder Singh
194
61
0
09 Sep 2021
Learn2Hop: Learned Optimization on Rough Landscapes
Learn2Hop: Learned Optimization on Rough LandscapesInternational Conference on Machine Learning (ICML), 2021
Amil Merchant
Luke Metz
S. Schoenholz
E. D. Cubuk
246
19
0
20 Jul 2021
Learning to Optimize: A Primer and A Benchmark
Learning to Optimize: A Primer and A BenchmarkJournal of machine learning research (JMLR), 2021
Tianlong Chen
Xiaohan Chen
Wuyang Chen
Howard Heaton
Jialin Liu
Zinan Lin
W. Yin
681
316
0
23 Mar 2021
Reverse engineering learned optimizers reveals known and novel
  mechanisms
Reverse engineering learned optimizers reveals known and novel mechanisms
Niru Maheswaranathan
David Sussillo
Luke Metz
Ruoxi Sun
Jascha Narain Sohl-Dickstein
385
26
0
04 Nov 2020
Training Stronger Baselines for Learning to Optimize
Training Stronger Baselines for Learning to OptimizeNeural Information Processing Systems (NeurIPS), 2020
Tianlong Chen
Weiyi Zhang
Jingyang Zhou
Shiyu Chang
Sijia Liu
Lisa Amini
Zinan Lin
OffRL
298
59
0
18 Oct 2020
Tasks, stability, architecture, and compute: Training more effective
  learned optimizers, and using them to train themselves
Tasks, stability, architecture, and compute: Training more effective learned optimizers, and using them to train themselves
Luke Metz
Niru Maheswaranathan
C. Freeman
Ben Poole
Jascha Narain Sohl-Dickstein
384
70
0
23 Sep 2020
12
Next
Page 1 of 2