ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.01075
  4. Cited By
OmniNet: Omnidirectional Representations from Transformers

OmniNet: Omnidirectional Representations from Transformers

International Conference on Machine Learning (ICML), 2021
1 March 2021
Yi Tay
Mostafa Dehghani
V. Aribandi
Jai Gupta
Philip Pham
Zhen Qin
Dara Bahri
Da-Cheng Juan
Donald Metzler
ArXiv (abs)PDFHTML

Papers citing "OmniNet: Omnidirectional Representations from Transformers"

19 / 19 papers shown
Anchored Diffusion Language Model
Anchored Diffusion Language Model
Litu Rout
Constantine Caramanis
Sanjay Shakkottai
362
4
0
24 May 2025
Variational Autoencoding Discrete Diffusion with Enhanced Dimensional Correlations Modeling
Tianyu Xie
Shuchen Xue
Zijin Feng
Tianyang Hu
Jiacheng Sun
Zhenguo Li
Cheng Zhang
DiffM
1.0K
1
0
23 May 2025
Continuous Diffusion Model for Language Modeling
Continuous Diffusion Model for Language Modeling
Jaehyeong Jo
Sung Ju Hwang
210
3
0
17 Feb 2025
MUDDFormer: Breaking Residual Bottlenecks in Transformers via Multiway Dynamic Dense Connections
MUDDFormer: Breaking Residual Bottlenecks in Transformers via Multiway Dynamic Dense Connections
Da Xiao
Qingye Meng
Shengping Li
Xingyuan Yuan
MoEAI4CE
493
9
0
13 Feb 2025
Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained
  Transformers
Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained TransformersNeural Information Processing Systems (NeurIPS), 2024
Lirui Wang
Xinlei Chen
Jialiang Zhao
Kaiming He
249
109
0
30 Sep 2024
CUPID: Improving Battle Fairness and Position Satisfaction in Online
  MOBA Games with a Re-matchmaking System
CUPID: Improving Battle Fairness and Position Satisfaction in Online MOBA Games with a Re-matchmaking System
Ge Fan
Chaoyun Zhang
Kai Wang
Yingjie Li
Junyang Chen
Zenglin Xu
203
5
0
28 Jun 2024
Simple and Effective Masked Diffusion Language Models
Simple and Effective Masked Diffusion Language Models
Subham Sekhar Sahoo
Marianne Arriola
Yair Schiff
Aaron Gokaslan
Edgar Marroquin
Justin T Chiu
Alexander M. Rush
Volodymyr Kuleshov
DiffM
258
348
0
11 Jun 2024
Cached Transformers: Improving Transformers with Differentiable Memory
  Cache
Cached Transformers: Improving Transformers with Differentiable Memory Cache
Zhaoyang Zhang
Wenqi Shao
Yixiao Ge
Xiaogang Wang
Liang Feng
Ping Luo
199
5
0
20 Dec 2023
FLORA: Fine-grained Low-Rank Architecture Search for Vision Transformer
FLORA: Fine-grained Low-Rank Architecture Search for Vision TransformerIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Chi-Chih Chang
Yuan-Yao Sung
Shixing Yu
N. Huang
Diana Marculescu
Kai-Chiang Wu
ViT
167
4
0
07 Nov 2023
QuickSkill: Novice Skill Estimation in Online Multiplayer Games
QuickSkill: Novice Skill Estimation in Online Multiplayer GamesInternational Conference on Information and Knowledge Management (CIKM), 2022
Chaoyun Zhang
Kai Wang
Hao Chen
Ge Fan
Yingjie Li
Lifang Wu
Bingchao Zheng
124
10
0
15 Aug 2022
Adaptive Cross-Layer Attention for Image Restoration
Adaptive Cross-Layer Attention for Image Restoration
Yancheng Wang
N. Xu
Yingzhen Yang
273
4
0
04 Mar 2022
ViNMT: Neural Machine Translation Toolkit
ViNMT: Neural Machine Translation Toolkit
Nguyen Hoang Quan
N. T. Dat
Nguyen Hoang Minh Cong
Nguyen Van Vinh
Ngo Thi Vinh
N. Thai
T. Viet
315
2
0
31 Dec 2021
Rank4Class: A Ranking Formulation for Multiclass Classification
Rank4Class: A Ranking Formulation for Multiclass Classification
Nan Wang
Zhen Qin
Le Yan
Honglei Zhuang
Xuanhui Wang
Michael Bendersky
Marc Najork
128
4
0
17 Dec 2021
The Efficiency Misnomer
The Efficiency MisnomerInternational Conference on Learning Representations (ICLR), 2021
Daoyuan Chen
Liuyi Yao
Dawei Gao
Ashish Vaswani
Yaliang Li
275
112
0
25 Oct 2021
SCENIC: A JAX Library for Computer Vision Research and Beyond
SCENIC: A JAX Library for Computer Vision Research and Beyond
Mostafa Dehghani
A. Gritsenko
Anurag Arnab
Matthias Minderer
Yi Tay
202
75
0
18 Oct 2021
Exploring the Limits of Large Scale Pre-training
Exploring the Limits of Large Scale Pre-training
Samira Abnar
Mostafa Dehghani
Behnam Neyshabur
Hanie Sedghi
AI4CE
208
133
0
05 Oct 2021
Long-Short Transformer: Efficient Transformers for Language and Vision
Long-Short Transformer: Efficient Transformers for Language and Vision
Chen Zhu
Ming-Yu Liu
Chaowei Xiao
Mohammad Shoeybi
Tom Goldstein
Anima Anandkumar
Bryan Catanzaro
ViTVLM
435
159
0
05 Jul 2021
KVT: k-NN Attention for Boosting Vision Transformers
KVT: k-NN Attention for Boosting Vision TransformersEuropean Conference on Computer Vision (ECCV), 2021
Pichao Wang
Qingsong Wen
F. Wang
Ming Lin
Shuning Chang
Hao Li
Rong Jin
ViT
253
129
0
28 May 2021
Dispatcher: A Message-Passing Approach To Language Modelling
Dispatcher: A Message-Passing Approach To Language Modelling
A. Cetoli
133
0
0
09 May 2021
1