Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1710.01949
Cited By
Semantic speech retrieval with a visually grounded model of untranscribed speech
5 October 2017
Herman Kamper
Gregory Shakhnarovich
Karen Livescu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Semantic speech retrieval with a visually grounded model of untranscribed speech"
6 / 6 papers shown
Title
Leveraging multilingual transfer for unsupervised semantic acoustic word embeddings
C. Jacobs
Herman Kamper
30
1
0
05 Jul 2023
Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model
Puyuan Peng
Shang-Wen Li
Okko Rasanen
Abdel-rahman Mohamed
David F. Harwath
SSL
VLM
26
7
0
19 May 2023
Towards visually prompted keyword localisation for zero-resource spoken languages
Leanne Nortje
Herman Kamper
11
6
0
12 Oct 2022
Improving Multimodal Speech Recognition by Data Augmentation and Speech Representations
Dan Oneaţă
H. Cucu
11
19
0
27 Apr 2022
Keyword localisation in untranscribed speech using visually grounded speech models
Kayode Olaleye
Dan Oneaţă
Herman Kamper
19
7
0
02 Feb 2022
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
Andrew Rouditchenko
Angie Boggust
David F. Harwath
Brian Chen
D. Joshi
...
Rogerio Feris
Brian Kingsbury
M. Picheny
Antonio Torralba
James R. Glass
SSL
22
141
0
16 Jun 2020
1