Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.03497
Cited By
EAT: Self-Supervised Pre-Training with Efficient Audio Transformer
7 January 2024
Wenxi Chen
Yuzhe Liang
Ziyang Ma
Zhisheng Zheng
Xie Chen
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"EAT: Self-Supervised Pre-Training with Efficient Audio Transformer"
7 / 7 papers shown
Title
Can Masked Autoencoders Also Listen to Birds?
Lukas Rauch
Ilyass Moummad
René Heinrich
Alexis Joly
Bernhard Sick
Christoph Scholz
27
0
0
17 Apr 2025
Exploring Differences between Human Perception and Model Inference in Audio Event Recognition
Yizhou Tan
Yanru Wu
Yuanbo Hou
Xin Xu
Hui Bu
Shengchen Li
Dick Botteldooren
Mark D. Plumbley
15
0
0
10 Sep 2024
AnoPatch: Towards Better Consistency in Machine Anomalous Sound Detection
Anbai Jiang
Bing Han
Zhiqiang Lv
Yufeng Deng
Wei-Qiang Zhang
Xie Chen
Yanmin Qian
Jia Liu
Pingyi Fan
27
3
0
17 Jun 2024
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection
Ke Chen
Xingjian Du
Bilei Zhu
Zejun Ma
Taylor Berg-Kirkpatrick
Shlomo Dubnov
ViT
114
264
0
02 Feb 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,412
0
11 Nov 2021
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
295
5,761
0
29 Apr 2021
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and Aggregation
Yuan Gong
Yu-An Chung
James R. Glass
VLM
99
144
0
02 Feb 2021
1