ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.13404
  4. Cited By
Improving Convergence and Generalization Using Parameter Symmetries

Improving Convergence and Generalization Using Parameter Symmetries

22 May 2023
Bo-Lu Zhao
Robert Mansel Gower
Robin G. Walters
Rose Yu
    MoMe
ArXivPDFHTML

Papers citing "Improving Convergence and Generalization Using Parameter Symmetries"

15 / 15 papers shown
Title
Improving Learning to Optimize Using Parameter Symmetries
Improving Learning to Optimize Using Parameter Symmetries
Guy Zamir
Aryan Dokania
B. Zhao
Rose Yu
17
0
0
21 Apr 2025
ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion
ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion
Rana Muhammad Shahroz Khan
Dongwen Tang
Pingzhi Li
Kai Wang
Tianlong Chen
AI4CE
59
0
0
31 Mar 2025
Continual Optimization with Symmetry Teleportation for Multi-Task Learning
Zhipeng Zhou
Ziqiao Meng
Pengcheng Wu
Peilin Zhao
Chunyan Miao
47
0
0
06 Mar 2025
Parameter Symmetry Breaking and Restoration Determines the Hierarchical Learning in AI Systems
Parameter Symmetry Breaking and Restoration Determines the Hierarchical Learning in AI Systems
Liu Ziyin
Yizhou Xu
T. Poggio
Isaac Chuang
48
4
0
07 Feb 2025
Beyond the Permutation Symmetry of Transformers: The Role of Rotation for Model Fusion
Beyond the Permutation Symmetry of Transformers: The Role of Rotation for Model Fusion
Binchi Zhang
Zaiyi Zheng
Zhengzhang Chen
Jundong Li
52
0
0
01 Feb 2025
Remove Symmetries to Control Model Expressivity and Improve Optimization
Remove Symmetries to Control Model Expressivity and Improve Optimization
Liu Ziyin
Yizhou Xu
Isaac Chuang
AAML
38
1
0
28 Aug 2024
Scale Equivariant Graph Metanetworks
Scale Equivariant Graph Metanetworks
Ioannis Kalogeropoulos
Giorgos Bouritsas
Yannis Panagakis
42
6
0
15 Jun 2024
Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models
Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models
Yi-Fan Zhang
Qingsong Wen
Chaoyou Fu
Xue Wang
Zhang Zhang
L. Wang
Rong Jin
34
40
0
12 Jun 2024
The Empirical Impact of Neural Parameter Symmetries, or Lack Thereof
The Empirical Impact of Neural Parameter Symmetries, or Lack Thereof
Derek Lim
Moe Putterman
Robin Walters
Haggai Maron
Stefanie Jegelka
35
5
0
30 May 2024
Level Set Teleportation: An Optimization Perspective
Level Set Teleportation: An Optimization Perspective
Aaron Mishkin
A. Bietti
Robert Mansel Gower
28
1
0
05 Mar 2024
Functional dimension of feedforward ReLU neural networks
Functional dimension of feedforward ReLU neural networks
J. E. Grigsby
Kathryn A. Lindsey
R. Meyerhoff
Chen-Chun Wu
27
11
0
08 Sep 2022
Neural Mechanics: Symmetry and Broken Conservation Laws in Deep Learning
  Dynamics
Neural Mechanics: Symmetry and Broken Conservation Laws in Deep Learning Dynamics
D. Kunin
Javier Sagastuy-Breña
Surya Ganguli
Daniel L. K. Yamins
Hidenori Tanaka
99
77
0
08 Dec 2020
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
243
11,568
0
09 Mar 2017
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp
  Minima
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
273
2,878
0
15 Sep 2016
Linear Convergence of Gradient and Proximal-Gradient Methods Under the
  Polyak-Łojasiewicz Condition
Linear Convergence of Gradient and Proximal-Gradient Methods Under the Polyak-Łojasiewicz Condition
Hamed Karimi
J. Nutini
Mark W. Schmidt
119
1,190
0
16 Aug 2016
1