ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.00104
  4. Cited By
MMViT: Multiscale Multiview Vision Transformers

MMViT: Multiscale Multiview Vision Transformers

28 April 2023
Yuchen Liu
Natasha Ong
Kaiyan Peng
Bo Xiong
Qifan Wang
Rui Hou
Madian Khabsa
Kaiyue Yang
David C. Liu
Donald Williamson
Hanchao Yu
    ViT
ArXivPDFHTML

Papers citing "MMViT: Multiscale Multiview Vision Transformers"

5 / 5 papers shown
Title
From Coarse to Fine: Efficient Training for Audio Spectrogram
  Transformers
From Coarse to Fine: Efficient Training for Audio Spectrogram Transformers
Jiu Feng
Mehmet Hamza Erol
Joon Son Chung
Arda Senocak
29
1
0
16 Jan 2024
ODEFormer: Symbolic Regression of Dynamical Systems with Transformers
ODEFormer: Symbolic Regression of Dynamical Systems with Transformers
Stéphane d’Ascoli
Soren Becker
Alexander Mathis
Philippe Schwaller
Niki Kilbertus
24
21
0
09 Oct 2023
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound
  Classification and Detection
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection
Ke Chen
Xingjian Du
Bilei Zhu
Zejun Ma
Taylor Berg-Kirkpatrick
Shlomo Dubnov
ViT
121
264
0
02 Feb 2022
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and
  Aggregation
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and Aggregation
Yuan Gong
Yu-An Chung
James R. Glass
VLM
104
144
0
02 Feb 2021
ImageNet Large Scale Visual Recognition Challenge
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
296
39,198
0
01 Sep 2014
1