Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.00104
Cited By
MMViT: Multiscale Multiview Vision Transformers
28 April 2023
Yuchen Liu
Natasha Ong
Kaiyan Peng
Bo Xiong
Qifan Wang
Rui Hou
Madian Khabsa
Kaiyue Yang
David C. Liu
Donald Williamson
Hanchao Yu
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MMViT: Multiscale Multiview Vision Transformers"
5 / 5 papers shown
Title
From Coarse to Fine: Efficient Training for Audio Spectrogram Transformers
Jiu Feng
Mehmet Hamza Erol
Joon Son Chung
Arda Senocak
29
1
0
16 Jan 2024
ODEFormer: Symbolic Regression of Dynamical Systems with Transformers
Stéphane d’Ascoli
Soren Becker
Alexander Mathis
Philippe Schwaller
Niki Kilbertus
24
21
0
09 Oct 2023
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection
Ke Chen
Xingjian Du
Bilei Zhu
Zejun Ma
Taylor Berg-Kirkpatrick
Shlomo Dubnov
ViT
118
264
0
02 Feb 2022
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and Aggregation
Yuan Gong
Yu-An Chung
James R. Glass
VLM
104
144
0
02 Feb 2021
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
296
39,198
0
01 Sep 2014
1