Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.13404
Cited By
Improving Convergence and Generalization Using Parameter Symmetries
22 May 2023
Bo-Lu Zhao
Robert Mansel Gower
Robin G. Walters
Rose Yu
MoMe
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Improving Convergence and Generalization Using Parameter Symmetries"
15 / 15 papers shown
Title
Improving Learning to Optimize Using Parameter Symmetries
Guy Zamir
Aryan Dokania
B. Zhao
Rose Yu
17
0
0
21 Apr 2025
ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion
Rana Muhammad Shahroz Khan
Dongwen Tang
Pingzhi Li
Kai Wang
Tianlong Chen
AI4CE
59
0
0
31 Mar 2025
Continual Optimization with Symmetry Teleportation for Multi-Task Learning
Zhipeng Zhou
Ziqiao Meng
Pengcheng Wu
Peilin Zhao
Chunyan Miao
47
0
0
06 Mar 2025
Parameter Symmetry Breaking and Restoration Determines the Hierarchical Learning in AI Systems
Liu Ziyin
Yizhou Xu
T. Poggio
Isaac Chuang
48
4
0
07 Feb 2025
Beyond the Permutation Symmetry of Transformers: The Role of Rotation for Model Fusion
Binchi Zhang
Zaiyi Zheng
Zhengzhang Chen
Jundong Li
52
0
0
01 Feb 2025
Remove Symmetries to Control Model Expressivity and Improve Optimization
Liu Ziyin
Yizhou Xu
Isaac Chuang
AAML
38
1
0
28 Aug 2024
Scale Equivariant Graph Metanetworks
Ioannis Kalogeropoulos
Giorgos Bouritsas
Yannis Panagakis
42
6
0
15 Jun 2024
Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models
Yi-Fan Zhang
Qingsong Wen
Chaoyou Fu
Xue Wang
Zhang Zhang
L. Wang
Rong Jin
34
40
0
12 Jun 2024
The Empirical Impact of Neural Parameter Symmetries, or Lack Thereof
Derek Lim
Moe Putterman
Robin Walters
Haggai Maron
Stefanie Jegelka
35
5
0
30 May 2024
Level Set Teleportation: An Optimization Perspective
Aaron Mishkin
A. Bietti
Robert Mansel Gower
28
1
0
05 Mar 2024
Functional dimension of feedforward ReLU neural networks
J. E. Grigsby
Kathryn A. Lindsey
R. Meyerhoff
Chen-Chun Wu
27
11
0
08 Sep 2022
Neural Mechanics: Symmetry and Broken Conservation Laws in Deep Learning Dynamics
D. Kunin
Javier Sagastuy-Breña
Surya Ganguli
Daniel L. K. Yamins
Hidenori Tanaka
99
77
0
08 Dec 2020
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
243
11,568
0
09 Mar 2017
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
273
2,878
0
15 Sep 2016
Linear Convergence of Gradient and Proximal-Gradient Methods Under the Polyak-Łojasiewicz Condition
Hamed Karimi
J. Nutini
Mark W. Schmidt
119
1,190
0
16 Aug 2016
1