ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.16284
  4. Cited By
Only Large Weights (And Not Skip Connections) Can Prevent the Perils of Rank Collapse

Only Large Weights (And Not Skip Connections) Can Prevent the Perils of Rank Collapse

22 May 2025
Josh Alman
Zhao Song
ArXiv (abs)PDFHTML

Papers citing "Only Large Weights (And Not Skip Connections) Can Prevent the Perils of Rank Collapse"

18 / 18 papers shown
Fundamental Limits of Crystalline Equivariant Graph Neural Networks: A Circuit Complexity Perspective
Fundamental Limits of Crystalline Equivariant Graph Neural Networks: A Circuit Complexity Perspective
Yang Cao
Zhao Song
Jiahao Zhang
Jiale Zhao
150
1
0
07 Oct 2025
Your Vision-Language Model Can't Even Count to 20: Exposing the Failures of VLMs in Compositional Counting
Your Vision-Language Model Can't Even Count to 20: Exposing the Failures of VLMs in Compositional Counting
Xuyang Guo
Zekai Huang
Zhenmei Shi
Zhao Song
Jiahao Zhang
CoGeVLM
287
4
0
06 Oct 2025
Towards Sampling Data Structures for Tensor Products in Turnstile Streams
Towards Sampling Data Structures for Tensor Products in Turnstile Streams
Zhao Song
Shenghao Xie
Samson Zhou
147
0
0
04 Oct 2025
Too Easily Fooled? Prompt Injection Breaks LLMs on Frustratingly Simple Multiple-Choice Questions
Too Easily Fooled? Prompt Injection Breaks LLMs on Frustratingly Simple Multiple-Choice Questions
Xuyang Guo
Zekai Huang
Zhao Song
Jiahao Zhang
LRM
144
3
0
16 Aug 2025
Towards High-Order Mean Flow Generative Models: Feasibility, Expressivity, and Provably Efficient Criteria
Towards High-Order Mean Flow Generative Models: Feasibility, Expressivity, and Provably Efficient Criteria
Yang Cao
Yubin Chen
Zhao Song
Jiahao Zhang
177
7
0
09 Aug 2025
Subquadratic Algorithms and Hardness for Attention with Any Temperature
Subquadratic Algorithms and Hardness for Attention with Any Temperature
Shreya Gupta
Boyang Huang
Barna Saha
Yinzhan Xu
Christopher Ye
265
2
0
20 May 2025
Fast RoPE Attention: Combining the Polynomial Method and Fast Fourier Transform
Fast RoPE Attention: Combining the Polynomial Method and Fast Fourier Transform
Josh Alman
Zhao Song
349
23
0
17 May 2025
T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models
T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models
Xuyang Guo
Jiayan Huo
Zhenmei Shi
Zhao Song
Jiahao Zhang
Jiale Zhao
VGen
1.1K
8
0
08 May 2025
Always Skip Attention
Always Skip Attention
Yiping Ji
Hemanth Saratchandran
Peyman Moghaddam
Simon Lucey
1.1K
6
0
04 May 2025
T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation
T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation
Xuyang Guo
Jiayan Huo
Zhenmei Shi
Zhao Song
Jiahao Zhang
Jiale Zhao
EGVMVGenPINN
484
22
0
01 May 2025
Can You Count to Nine? A Human Evaluation Benchmark for Counting Limits in Modern Text-to-Video Models
Can You Count to Nine? A Human Evaluation Benchmark for Counting Limits in Modern Text-to-Video Models
Xuyang Guo
Zekai Huang
Jiayan Huo
Yingyu Liang
Zhenmei Shi
Zhao Song
Jiahao Zhang
ALMVGen
503
13
0
05 Apr 2025
When Can We Solve the Weighted Low Rank Approximation Problem in Truly Subquadratic Time?
When Can We Solve the Weighted Low Rank Approximation Problem in Truly Subquadratic Time?International Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Chenyang Li
Yingyu Liang
Zhenmei Shi
Zhao Song
245
6
0
24 Feb 2025
Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies
Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies
Yibo Wen
Chenwei Xu
Jerry Yao-Chieh Hu
Han Liu
Han Liu
DiffM
275
6
0
30 Dec 2024
RoPE Attention Can Be Trained in Almost Linear Time
RoPE Attention Can Be Trained in Almost Linear Time
Yifang Chen
Jiayan Huo
Xiaoyu Li
Yingyu Liang
Zhenmei Shi
354
19
0
23 Dec 2024
One Step Diffusion via Shortcut Models
One Step Diffusion via Shortcut ModelsInternational Conference on Learning Representations (ICLR), 2024
Kevin Frans
Danijar Hafner
Sergey Levine
Pieter Abbeel
VLMDiffM
650
175
0
16 Oct 2024
HSR-Enhanced Sparse Attention Acceleration
HSR-Enhanced Sparse Attention Acceleration
Bo Chen
Yingyu Liang
Zhizhou Sha
Zhenmei Shi
Zhao Song
818
24
0
14 Oct 2024
Binary Hypothesis Testing for Softmax Models and Leverage Score Models
Binary Hypothesis Testing for Softmax Models and Leverage Score Models
Yeqi Gao
Yuzhou Gu
Zhao Song
413
1
0
09 May 2024
The Expressibility of Polynomial based Attention Scheme
The Expressibility of Polynomial based Attention Scheme
Zhao Song
Guangyi Xu
Junze Yin
320
7
0
30 Oct 2023
1
Page 1 of 1