Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.09089
Cited By
Bridging High-Quality Audio and Video via Language for Sound Effects Retrieval from Visual Queries
17 August 2023
J. Wilkins
Justin Salamon
Magdalena Fuentes
J. P. Bello
Oriol Nieto
CLIP
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Bridging High-Quality Audio and Video via Language for Sound Effects Retrieval from Visual Queries"
8 / 8 papers shown
Title
FLAM: Frame-Wise Language-Audio Modeling
Yusong Wu
Christos Tsirigotis
Ke Chen
Cheng-Zhi Anna Huang
Aaron C. Courville
Oriol Nieto
Prem Seetharaman
Justin Salamon
43
0
0
08 May 2025
Audio-Language Datasets of Scenes and Events: A Survey
Gijs Wijngaard
Elia Formisano
Michele Esposito
M. Dumontier
79
2
0
10 Jan 2025
Learning Self-Supervised Audio-Visual Representations for Sound Recommendations
Sudha Krishnamurthy
SSL
73
1
0
10 Dec 2024
D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matching
Jingyu Liu
Minquan Wang
Ye Ma
Bo Wang
Aozhu Chen
Quan Chen
Peng Jiang
Xirong Li
36
1
0
23 Aug 2024
Enhancing Audio Generation Diversity with Visual Information
Zeyu Xie
Baihan Li
Xuenan Xu
Mengyue Wu
Kai Yu
19
1
0
02 Mar 2024
Enhanced Sound Event Localization and Detection in Real 360-degree audio-visual soundscapes
Adrian S. Roman
Baladithya Balamurugan
Rithik Pothuganti
17
5
0
29 Jan 2024
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Kristen Grauman
Andrew Westbury
Eugene Byrne
Zachary Chavis
Antonino Furnari
...
Mike Zheng Shou
Antonio Torralba
Lorenzo Torresani
Mingfei Yan
Jitendra Malik
EgoV
221
1,017
0
13 Oct 2021
Learning to Prompt for Vision-Language Models
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
VPVLM
CLIP
VLM
322
2,249
0
02 Sep 2021
1