Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1912.12180
Cited By
Axial Attention in Multidimensional Transformers
20 December 2019
Jonathan Ho
Nal Kalchbrenner
Dirk Weissenborn
Tim Salimans
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Axial Attention in Multidimensional Transformers"
50 / 314 papers shown
Title
EEG-VLM: A Hierarchical Vision-Language Model with Multi-Level Feature Alignment and Visually Enhanced Language-Guided Reasoning for EEG Image-Based Sleep Stage Prediction
Xihe Qiu
Gengchen Ma
Haoyu Wang
Chen Zhan
Xiaoyu Tan
Shuo Li
VLM
99
0
0
24 Nov 2025
Predicting partially observable dynamical systems via diffusion models with a multiscale inference scheme
Rudy Morel
Francesco Pio Ramunno
Jeff Shen
Alberto Bietti
Kyunghyun Cho
...
François Rozet
K. Leka
F. Lanusse
David Fouhey
Shirley Ho
DiffM
AI4CE
329
0
0
24 Nov 2025
Walrus: A Cross-Domain Foundation Model for Continuum Dynamics
Michael McCabe
Payel Mukhopadhyay
Tanya Marwah
Bruno Régaldo-Saint Blancard
François Rozet
...
Mariel Pettee
Jeff Shen
Kyunghyun Cho
M. Cranmer
S. Ho
AI4CE
156
0
0
19 Nov 2025
EMOD: A Unified EEG Emotion Representation Framework Leveraging V-A Guided Contrastive Learning
Yuning Chen
Sha Zhao
Shijian Li
Gang Pan
46
0
0
08 Nov 2025
Order-Level Attention Similarity Across Language Models: A Latent Commonality
Jinglin Liang
Jin Zhong
Shuangping Huang
Yunqing Hu
Huiyuan Zhang
Huifang Li
Lixin Fan
Hanlin Gu
64
0
0
07 Nov 2025
Jasmine: A Simple, Performant and Scalable JAX-based World Modeling Codebase
Mihir Mahajan
Alfred Nguyen
Franz Srambical
Stefan Bauer
144
0
0
30 Oct 2025
Triangle Multiplication Is All You Need For Biomolecular Structure Representations
Jeffrey Ouyang-Zhang
Pranav Murugan
Daniel J. Diaz
Gianluca Scarpellini
Richard Strong Bowen
Nate Gruver
Adam Klivans
Philipp Krahenbuhl
Aleksandra Faust
Maruan Al-Shedivat
180
1
0
21 Oct 2025
Context-Selective State Space Models: Feedback is All You Need
Riccardo Zattra
Giacomo Baggio
Umberto Casti
Augusto Ferrante
Francesco Ticozzi
Mamba
108
0
0
15 Oct 2025
GapDNER: A Gap-Aware Grid Tagging Model for Discontinuous Named Entity Recognition
Yawen Yang
Fukun Ma
Shiao Meng
Aiwei Liu
Lijie Wen
68
0
0
13 Oct 2025
Adaptive Stain Normalization for Cross-Domain Medical Histology
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Tianyue Xu
Yanlin Wu
Abhai K. Tripathi
Matthew M. Ippolito
Benjamin D. Haeffele
OOD
MedIm
76
0
0
08 Oct 2025
Probability calibration for precipitation nowcasting
Lauri Kurki
Yaniel Cabrera
Samu Karanko
69
0
0
01 Oct 2025
Generalized Parallel Scaling with Interdependent Generations
Harry Dong
David Brandfonbrener
Eryk Helenowski
Yun He
Mrinal Kumar
Han Fang
Yuejie Chi
Karthik Abinav Sankararaman
LRM
96
0
0
01 Oct 2025
Training Agents Inside of Scalable World Models
Danijar Hafner
Wilson Yan
Timothy Lillicrap
VGen
111
13
0
29 Sep 2025
MORPH: PDE Foundation Models with Arbitrary Data Modality
Mahindra Singh Rautela
Alexander Buschmann Most
Siddharth Mansingh
Bradley Love
Ayan Biswas
Diane Oyen
Earl Lawrence
AI4TS
AI4CE
214
0
0
25 Sep 2025
Towards Self-Supervised Foundation Models for Critical Care Time Series
Katja Naasunnguaq Jagd
Rachael DeVries
Ole Winther
AI4TS
85
0
0
24 Sep 2025
Learning spatially structured open quantum dynamics with regional-attention transformers
Dounan Du
Eden Figueroa
AI4CE
36
0
0
08 Sep 2025
FW-GAN: Frequency-Driven Handwriting Synthesis with Wave-Modulated MLP Generator
Huynh Tong Dang Khoa
Dang Hoai Nam
Vo Nguyen Le Duy
56
0
0
28 Aug 2025
Bi-Axial Transformers: Addressing the Increasing Complexity of EHR Classification
Rachael DeVries
Casper Christensen
Marie Lisandra Zepeda Mendoza
Ole Winther
64
1
0
17 Aug 2025
Advances in Speech Separation: Techniques, Challenges, and Future Trends
Kai Li
Guo Chen
Wendi Sang
Yi Luo
Zhuo Chen
...
Shulin He
Zhong-Qiu Wang
Andong Li
Z. Wu
Xiaolin Hu
AI4TS
88
4
0
14 Aug 2025
Bubbleformer: Forecasting Boiling with Transformers
Sheikh Md Shakeel Hassan
Xianwei Zou
A. Dhruv
Vishwanath Ganesan
Aparna Chandramowlishwaran
AI4CE
270
1
0
28 Jul 2025
Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics Emulation
François Rozet
Ruben Ohana
Michael McCabe
Gilles Louppe
F. Lanusse
S. Ho
DiffM
151
7
0
03 Jul 2025
A Survey of Foundation Models for IoT: Taxonomy and Criteria-Based Analysis
Hui Wei
Dong Yoon Lee
Shubham Rohal
Zhizhang Hu
Ryan Rossi
Shiwei Fang
Shijia Pan
229
3
0
13 Jun 2025
Plug-and-Play Linear Attention for Pre-trained Image and Video Restoration Models
Srinivasan Kidambi
Pravin Nair
99
0
0
10 Jun 2025
DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference Acceleration
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Hanzhi Zhang
Heng Fan
Kewei Sha
Yan Huang
Yunhe Feng
150
2
0
06 Jun 2025
Transformers Are Universally Consistent
Sagar Ghosh
Kushal Bose
Swagatam Das
112
0
0
30 May 2025
PDE-Transformer: Efficient and Versatile Transformers for Physics Simulations
Benjamin Holzschuh
Qiang Liu
Georg Kohl
Nils Thuerey
AI4CE
216
5
0
30 May 2025
Universal Biological Sequence Reranking for Improved De Novo Peptide Sequencing
Zijie Qiu
Jiaqi Wei
Xiang Zhang
Sheng Xu
Kai Zou
Zhi Jin
Zhiqiang Gao
Nanqing Dong
S. Sun
BDL
276
5
0
23 May 2025
Out-of-distribution generalisation is hard: evidence from ARC-like tasks
George Dimitriadis
Spyridon Samothrakis
297
0
0
14 May 2025
Axial-UNet: A Neural Weather Model for Precipitation Nowcasting
Maitreya Sonawane
Sumit Mamtani
308
0
0
28 Apr 2025
Distilling semantically aware orders for autoregressive image generation
Rishav Pramanik
Antoine Poupon
Juan A. Rodriguez
Masih Aminbeidokhti
David Vazquez
Christopher Pal
Zhaozheng Yin
M. Pedersoli
236
0
0
23 Apr 2025
CirT: Global Subseasonal-to-Seasonal Forecasting with Geometry-inspired Transformer
International Conference on Learning Representations (ICLR), 2025
Yang Liu
Zinan Zheng
Jiashun Cheng
Fugee Tsung
Deli Zhao
Yu Rong
Jiajun Li
223
11
0
27 Feb 2025
Protein Large Language Models: A Comprehensive Survey
Yijia Xiao
Wanjia Zhao
Junkai Zhang
Yiqiao Jin
Han Zhang
...
Xiao Luo
Yu Zhang
James Zou
Yizhou Sun
Wei Wang
LM&MA
AI4CE
368
15
0
21 Feb 2025
Universal Lesion Segmentation Challenge 2023: A Comparative Research of Different Algorithms
Kaiwen Shi
Yifei Li
Binh Ho
Jovian Wang
Kobe Guo
OOD
94
0
0
14 Feb 2025
Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis
Amir Hosein Fadaei
M. Dehaqani
285
0
0
11 Feb 2025
LSU-Net: Lightweight Automatic Organs Segmentation Network For Medical Images
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Yujie Ding
Shenghua Teng
Zuoyong Li
Xiao Chen
SSeg
177
0
0
27 Jan 2025
ZETA: Leveraging Z-order Curves for Efficient Top-k Attention
International Conference on Learning Representations (ICLR), 2025
Qiuhao Zeng
Jerry Huang
Peng Lu
Gezheng Xu
Boxing Chen
Charles Ling
Boyu Wang
496
5
0
24 Jan 2025
Parallel Sequence Modeling via Generalized Spatial Propagation Network
Computer Vision and Pattern Recognition (CVPR), 2025
Hongjun Wang
Wonmin Byeon
Jiarui Xu
Liang Feng
Ka Chun Cheung
Xiaolong Wang
Kai Han
Jan Kautz
Sifei Liu
750
3
0
21 Jan 2025
VaeDiff-DocRE: End-to-end Data Augmentation Framework for Document-level Relation Extraction
International Conference on Computational Linguistics (COLING), 2024
Khai Phan Tran
Wen Hua
Xue Li
SyDa
278
0
0
18 Dec 2024
Community Research Earth Digital Intelligence Twin (CREDIT)
npj Climate and Atmospheric Science (npj Clim. Atmos. Sci.), 2024
John S. Schreck
Yingkai Sha
William E. Chapman
Dhamma Kimpara
Judith Berner
Seth McGinnis
Arnold Kazadi
Negin Sobhani
Ben Kirk
David John Gagne II
AI4Cl
198
1
0
09 Nov 2024
ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis
Xinyu Geng
Jiaming Wang
Jun Xu
MedIm
211
1
0
03 Nov 2024
Retrieval Augmented Diffusion Model for Structure-informed Antibody Design and Optimization
International Conference on Learning Representations (ICLR), 2024
Zichen Wang
Yaokun Ji
Jianing Tian
Shuangjia Zheng
DiffM
612
4
0
19 Oct 2024
Learning to refine domain knowledge for biological network inference
Peiwen Li
Menghua Wu
CML
169
2
0
18 Oct 2024
Metalic: Meta-Learning In-Context with Protein Language Models
International Conference on Learning Representations (ICLR), 2024
Jacob Beck
Shikha Surana
Manus McAuliffe
Oliver Bent
Thomas D. Barrett
Juan Jose Garau Luis
Paul Duckworth
AI4CE
294
3
0
10 Oct 2024
System-Level Safety Monitoring and Recovery for Perception Failures in Autonomous Vehicles
IEEE International Conference on Robotics and Automation (ICRA), 2024
Kaustav Chakraborty
Zeyuan Feng
Sushant Veer
Apoorva Sharma
Boris Ivanovic
Marco Pavone
Somil Bansal
214
5
0
26 Sep 2024
GASA-UNet: Global Axial Self-Attention U-Net for 3D Medical Image Segmentation
Chengkun Sun
Russell Stevens Terry
Jiang Bian
Jie Xu
3DPC
174
2
0
20 Sep 2024
PROSE-FD: A Multimodal PDE Foundation Model for Learning Multiple Operators for Forecasting Fluid Dynamics
Yuxuan Liu
Jingmin Sun
Xinjie He
Griffin Pinney
Zecheng Zhang
Hayden Schaeffer
AI4CE
183
18
0
15 Sep 2024
Macformer: Transformer with Random Maclaurin Feature Attention
Yuhan Guo
Lizhong Ding
Ye Yuan
Guoren Wang
186
0
0
21 Aug 2024
Nonlocal Attention Operator: Materializing Hidden Knowledge Towards Interpretable Physics Discovery
Neural Information Processing Systems (NeurIPS), 2024
Yue Yu
Ning Liu
Fei Lu
Tian Gao
S. Jafarzadeh
Stewart Silling
AI4CE
217
21
0
14 Aug 2024
Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images
IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2024
Zewen Du
Zhenjiang Hu
Guiyu Zhao
Ying Jin
Hongbin Ma
ViT
325
22
0
29 Jul 2024
A Survey on Cell Nuclei Instance Segmentation and Classification: Leveraging Context and Attention
João D. Nunes
D. Montezuma
Domingos Oliveira
Tania Pereira
Jaime S. Cardoso
208
0
0
26 Jul 2024
1
2
3
4
5
6
7
Next