Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.18058
Cited By
I can listen but cannot read: An evaluation of two-tower multimodal systems for instrument recognition
25 July 2024
Yannis Vasilakis
Rachel M. Bittner
Johan Pauwels
Re-assign community
ArXiv
PDF
HTML
Papers citing
"I can listen but cannot read: An evaluation of two-tower multimodal systems for instrument recognition"
3 / 3 papers shown
Title
What does a platypus look like? Generating customized prompts for zero-shot image classification
Sarah M Pratt
Ian Covert
Rosanne Liu
Ali Farhadi
VLM
116
211
0
07 Sep 2022
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection
Ke Chen
Xingjian Du
Bilei Zhu
Zejun Ma
Taylor Berg-Kirkpatrick
Shlomo Dubnov
ViT
111
262
0
02 Feb 2022
Efficient Estimation of Word Representations in Vector Space
Tomáš Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
228
29,632
0
16 Jan 2013
1