Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1610.03342
Cited By
From phonemes to images: levels of representation in a recurrent neural model of visually-grounded language learning
International Conference on Computational Linguistics (COLING), 2016
11 October 2016
Lieke Gelderloos
Grzegorz Chrupała
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"From phonemes to images: levels of representation in a recurrent neural model of visually-grounded language learning"
22 / 22 papers shown
Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning
Saurabhchand Bhati
Jesús Villalba
Laureano Moro-Velazquez
Thomas Thebaud
Najim Dehak
CLIP
189
4
0
08 Sep 2023
Exploring How Generative Adversarial Networks Learn Phonological Representations
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Jing Chen
Micha Elsner
GAN
119
6
0
21 May 2023
Towards visually prompted keyword localisation for zero-resource spoken languages
Spoken Language Technology Workshop (SLT), 2022
Leanne Nortje
Herman Kamper
151
6
0
12 Oct 2022
Word Segmentation on Discovered Phone Units with Dynamic Programming and Self-Supervised Scoring
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Herman Kamper
259
31
0
24 Feb 2022
Keyword localisation in untranscribed speech using visually grounded speech models
IEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022
Kayode Olaleye
Dan Oneaţă
Herman Kamper
193
7
0
02 Feb 2022
Attention-Based Keyword Localisation in Speech using Visual Grounding
Kayode Olaleye
Herman Kamper
109
13
0
16 Jun 2021
Probing artificial neural networks: insights from neuroscience
Anna A. Ivanova
John Hewitt
Noga Zaslavsky
172
18
0
16 Apr 2021
Talk, Don't Write: A Study of Direct Speech-Based Image Retrieval
Interspeech (Interspeech), 2021
Ramon Sanabria
Austin Waters
Jason Baldridge
3DV
192
27
0
05 Apr 2021
Towards localisation of keywords in speech using weak supervision
Kayode Olaleye
Benjamin van Niekerk
Herman Kamper
91
5
0
14 Dec 2020
On the Contributions of Visual and Textual Supervision in Low-Resource Semantic Speech Retrieval
Ankita Pasad
Bowen Shi
Herman Kamper
Karen Livescu
134
12
0
24 Apr 2019
The emergence of number and syntax units in LSTM language models
North American Chapter of the Association for Computational Linguistics (NAACL), 2019
Yair Lakretz
Germán Kruszewski
T. Desbordes
Dieuwke Hupkes
S. Dehaene
Marco Baroni
294
180
0
18 Mar 2019
Analysis Methods in Neural Language Processing: A Survey
Yonatan Belinkov
James R. Glass
275
593
0
21 Dec 2018
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input
David Harwath
Adrià Recasens
Dídac Surís
Galen Chuang
Antonio Torralba
James R. Glass
241
207
0
04 Apr 2018
Visualisation and 'diagnostic classifiers' reveal how recurrent and recursive neural networks process hierarchical structure
Dieuwke Hupkes
Sara Veldhoen
Willem H. Zuidema
262
295
0
28 Nov 2017
Semantic speech retrieval with a visually grounded model of untranscribed speech
Herman Kamper
Gregory Shakhnarovich
Karen Livescu
163
54
0
05 Oct 2017
Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems
Yonatan Belinkov
James R. Glass
106
91
0
13 Sep 2017
Encoding of phonology in a recurrent neural model of grounded speech
Conference on Computational Natural Language Learning (CoNLL), 2017
Afra Alishahi
Marie Barking
Grzegorz Chrupała
177
60
0
12 Jun 2017
Imagination improves Multimodal Translation
Desmond Elliott
Ákos Kádár
295
146
0
11 May 2017
What do Neural Machine Translation Models Learn about Morphology?
Yonatan Belinkov
Nadir Durrani
Fahim Dalvi
Hassan Sajjad
James R. Glass
385
428
0
11 Apr 2017
Visually grounded learning of keyword prediction from untranscribed speech
Herman Kamper
Shane Settle
Gregory Shakhnarovich
Karen Livescu
251
64
0
23 Mar 2017
Representations of language in a model of visually grounded speech signal
Annual Meeting of the Association for Computational Linguistics (ACL), 2017
Grzegorz Chrupała
Lieke Gelderloos
Afra Alishahi
232
133
0
07 Feb 2017
Learning Word-Like Units from Joint Audio-Visual Analysis
Annual Meeting of the Association for Computational Linguistics (ACL), 2017
David Harwath
James R. Glass
219
107
0
25 Jan 2017
1