ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.16284
  4. Cited By
Only Large Weights (And Not Skip Connections) Can Prevent the Perils of Rank Collapse

Only Large Weights (And Not Skip Connections) Can Prevent the Perils of Rank Collapse

22 May 2025
Josh Alman
Zhao Song
ArXiv (abs)PDFHTML

Papers citing "Only Large Weights (And Not Skip Connections) Can Prevent the Perils of Rank Collapse"

18 / 18 papers shown
Title
Fundamental Limits of Crystalline Equivariant Graph Neural Networks: A Circuit Complexity Perspective
Fundamental Limits of Crystalline Equivariant Graph Neural Networks: A Circuit Complexity Perspective
Yang Cao
Zhao Song
Jiahao Zhang
Jiale Zhao
139
0
0
07 Oct 2025
Your Vision-Language Model Can't Even Count to 20: Exposing the Failures of VLMs in Compositional Counting
Your Vision-Language Model Can't Even Count to 20: Exposing the Failures of VLMs in Compositional Counting
Xuyang Guo
Zekai Huang
Zhenmei Shi
Zhao Song
Jiahao Zhang
CoGeVLM
262
4
0
06 Oct 2025
Towards Sampling Data Structures for Tensor Products in Turnstile Streams
Towards Sampling Data Structures for Tensor Products in Turnstile Streams
Zhao Song
Shenghao Xie
Samson Zhou
128
0
0
04 Oct 2025
Too Easily Fooled? Prompt Injection Breaks LLMs on Frustratingly Simple Multiple-Choice Questions
Too Easily Fooled? Prompt Injection Breaks LLMs on Frustratingly Simple Multiple-Choice Questions
Xuyang Guo
Zekai Huang
Zhao Song
Jiahao Zhang
LRM
136
3
0
16 Aug 2025
Towards High-Order Mean Flow Generative Models: Feasibility, Expressivity, and Provably Efficient Criteria
Towards High-Order Mean Flow Generative Models: Feasibility, Expressivity, and Provably Efficient Criteria
Yang Cao
Yubin Chen
Zhao Song
Jiahao Zhang
155
7
0
09 Aug 2025
Subquadratic Algorithms and Hardness for Attention with Any Temperature
Subquadratic Algorithms and Hardness for Attention with Any Temperature
Shreya Gupta
Boyang Huang
Barna Saha
Yinzhan Xu
Christopher Ye
247
2
0
20 May 2025
Fast RoPE Attention: Combining the Polynomial Method and Fast Fourier Transform
Fast RoPE Attention: Combining the Polynomial Method and Fast Fourier Transform
Josh Alman
Zhao Song
308
23
0
17 May 2025
T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models
T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models
Xuyang Guo
Jiayan Huo
Zhenmei Shi
Zhao Song
Jiahao Zhang
Jiale Zhao
VGen
1.0K
8
0
08 May 2025
Always Skip Attention
Always Skip Attention
Yiping Ji
Hemanth Saratchandran
Peyman Moghaddam
Simon Lucey
1.0K
6
0
04 May 2025
T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation
T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation
Xuyang Guo
Jiayan Huo
Zhenmei Shi
Zhao Song
Jiahao Zhang
Jiale Zhao
EGVMVGenPINN
432
21
0
01 May 2025
Can You Count to Nine? A Human Evaluation Benchmark for Counting Limits in Modern Text-to-Video Models
Can You Count to Nine? A Human Evaluation Benchmark for Counting Limits in Modern Text-to-Video Models
Xuyang Guo
Zekai Huang
Jiayan Huo
Yingyu Liang
Zhenmei Shi
Zhao Song
Jiahao Zhang
ALMVGen
450
13
0
05 Apr 2025
When Can We Solve the Weighted Low Rank Approximation Problem in Truly Subquadratic Time?
When Can We Solve the Weighted Low Rank Approximation Problem in Truly Subquadratic Time?International Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Chenyang Li
Yingyu Liang
Zhenmei Shi
Zhao Song
204
6
0
24 Feb 2025
Fast Gradient Computation for RoPE Attention in Almost Linear Time
Fast Gradient Computation for RoPE Attention in Almost Linear Time
Yifang Chen
Jiayan Huo
Xiaoyu Li
Yingyu Liang
Zhenmei Shi
Zhao Song
306
19
0
03 Jan 2025
Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies
Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies
Yibo Wen
Chenwei Xu
Jerry Yao-Chieh Hu
Han Liu
Han Liu
DiffM
242
6
0
30 Dec 2024
One Step Diffusion via Shortcut Models
One Step Diffusion via Shortcut ModelsInternational Conference on Learning Representations (ICLR), 2024
Kevin Frans
Danijar Hafner
Sergey Levine
Pieter Abbeel
VLMDiffM
613
165
0
16 Oct 2024
HSR-Enhanced Sparse Attention Acceleration
HSR-Enhanced Sparse Attention Acceleration
Bo Chen
Yingyu Liang
Zhizhou Sha
Zhenmei Shi
Zhao Song
638
24
0
14 Oct 2024
Binary Hypothesis Testing for Softmax Models and Leverage Score Models
Binary Hypothesis Testing for Softmax Models and Leverage Score Models
Yeqi Gao
Yuzhou Gu
Zhao Song
372
1
0
09 May 2024
The Expressibility of Polynomial based Attention Scheme
The Expressibility of Polynomial based Attention Scheme
Zhao Song
Guangyi Xu
Junze Yin
290
7
0
30 Oct 2023
1