Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.15953
Cited By
Masked Audio Modeling with CLAP and Multi-Objective Learning
29 January 2024
Yifei Xin
Xiulian Peng
Yan Lu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Masked Audio Modeling with CLAP and Multi-Objective Learning"
6 / 6 papers shown
Title
Improving Weakly Supervised Sound Event Detection with Causal Intervention
Yifei Xin
Dongchao Yang
Fan Cui
Yujun Wang
Yuexian Zou
CML
35
6
0
10 Mar 2023
SupMAE: Supervised Masked Autoencoders Are Efficient Vision Learners
Feng Liang
Yangguang Li
Diana Marculescu
SSL
TPM
ViT
14
15
0
28 May 2022
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection
Ke Chen
Xingjian Du
Bilei Zhu
Zejun Ma
Taylor Berg-Kirkpatrick
Shlomo Dubnov
ViT
98
181
0
02 Feb 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
252
5,353
0
11 Nov 2021
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
250
3,694
0
24 Feb 2021
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and Aggregation
Yuan Gong
Yu-An Chung
James R. Glass
VLM
80
116
0
02 Feb 2021
1