Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.17517
Cited By
Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples
30 March 2023
H. Ryu
Arda Senocak
In So Kweon
Joon Son Chung
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples"
6 / 6 papers shown
Title
Let Me Finish My Sentence: Video Temporal Grounding with Holistic Text Understanding
Jongbhin Woo
H. Ryu
Youngjoon Jang
Jae-Won Cho
Joon Son Chung
21
0
0
17 Oct 2024
Aligning Sight and Sound: Advanced Sound Source Localization Through Audio-Visual Alignment
Arda Senocak
H. Ryu
Junsik Kim
Tae-Hyun Oh
Hanspeter Pfister
Joon Son Chung
24
3
0
18 Jul 2024
Sound Source Localization is All about Cross-Modal Alignment
Arda Senocak
H. Ryu
Junsik Kim
Tae-Hyun Oh
Hanspeter Pfister
Joon Son Chung
19
18
0
19 Sep 2023
Cascaded Multilingual Audio-Visual Learning from Videos
Andrew Rouditchenko
Angie Boggust
David F. Harwath
Samuel Thomas
Hilde Kuehne
...
Rameswar Panda
Rogerio Feris
Brian Kingsbury
M. Picheny
James R. Glass
47
8
0
08 Nov 2021
With a Little Help from My Friends: Nearest-Neighbor Contrastive Learning of Visual Representations
Debidatta Dwibedi
Y. Aytar
Jonathan Tompson
P. Sermanet
Andrew Zisserman
SSL
183
450
0
29 Apr 2021
Self-supervised Co-training for Video Representation Learning
Tengda Han
Weidi Xie
Andrew Zisserman
SSL
204
304
0
19 Oct 2020
1