ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1610.03342
  4. Cited By
From phonemes to images: levels of representation in a recurrent neural
  model of visually-grounded language learning

From phonemes to images: levels of representation in a recurrent neural model of visually-grounded language learning

International Conference on Computational Linguistics (COLING), 2016
11 October 2016
Lieke Gelderloos
Grzegorz Chrupała
ArXiv (abs)PDFHTML

Papers citing "From phonemes to images: levels of representation in a recurrent neural model of visually-grounded language learning"

22 / 22 papers shown
Leveraging Pretrained Image-text Models for Improving Audio-Visual
  Learning
Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning
Saurabhchand Bhati
Jesús Villalba
Laureano Moro-Velazquez
Thomas Thebaud
Najim Dehak
CLIP
189
4
0
08 Sep 2023
Exploring How Generative Adversarial Networks Learn Phonological
  Representations
Exploring How Generative Adversarial Networks Learn Phonological RepresentationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Jing Chen
Micha Elsner
GAN
119
6
0
21 May 2023
Towards visually prompted keyword localisation for zero-resource spoken
  languages
Towards visually prompted keyword localisation for zero-resource spoken languagesSpoken Language Technology Workshop (SLT), 2022
Leanne Nortje
Herman Kamper
151
6
0
12 Oct 2022
Word Segmentation on Discovered Phone Units with Dynamic Programming and
  Self-Supervised Scoring
Word Segmentation on Discovered Phone Units with Dynamic Programming and Self-Supervised ScoringIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Herman Kamper
259
31
0
24 Feb 2022
Keyword localisation in untranscribed speech using visually grounded
  speech models
Keyword localisation in untranscribed speech using visually grounded speech modelsIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022
Kayode Olaleye
Dan Oneaţă
Herman Kamper
193
7
0
02 Feb 2022
Attention-Based Keyword Localisation in Speech using Visual Grounding
Attention-Based Keyword Localisation in Speech using Visual Grounding
Kayode Olaleye
Herman Kamper
109
13
0
16 Jun 2021
Probing artificial neural networks: insights from neuroscience
Probing artificial neural networks: insights from neuroscience
Anna A. Ivanova
John Hewitt
Noga Zaslavsky
172
18
0
16 Apr 2021
Talk, Don't Write: A Study of Direct Speech-Based Image Retrieval
Talk, Don't Write: A Study of Direct Speech-Based Image RetrievalInterspeech (Interspeech), 2021
Ramon Sanabria
Austin Waters
Jason Baldridge
3DV
192
27
0
05 Apr 2021
Towards localisation of keywords in speech using weak supervision
Towards localisation of keywords in speech using weak supervision
Kayode Olaleye
Benjamin van Niekerk
Herman Kamper
91
5
0
14 Dec 2020
On the Contributions of Visual and Textual Supervision in Low-Resource
  Semantic Speech Retrieval
On the Contributions of Visual and Textual Supervision in Low-Resource Semantic Speech Retrieval
Ankita Pasad
Bowen Shi
Herman Kamper
Karen Livescu
134
12
0
24 Apr 2019
The emergence of number and syntax units in LSTM language models
The emergence of number and syntax units in LSTM language modelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2019
Yair Lakretz
Germán Kruszewski
T. Desbordes
Dieuwke Hupkes
S. Dehaene
Marco Baroni
294
180
0
18 Mar 2019
Analysis Methods in Neural Language Processing: A Survey
Analysis Methods in Neural Language Processing: A Survey
Yonatan Belinkov
James R. Glass
275
593
0
21 Dec 2018
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory
  Input
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input
David Harwath
Adrià Recasens
Dídac Surís
Galen Chuang
Antonio Torralba
James R. Glass
241
207
0
04 Apr 2018
Visualisation and 'diagnostic classifiers' reveal how recurrent and
  recursive neural networks process hierarchical structure
Visualisation and 'diagnostic classifiers' reveal how recurrent and recursive neural networks process hierarchical structure
Dieuwke Hupkes
Sara Veldhoen
Willem H. Zuidema
262
295
0
28 Nov 2017
Semantic speech retrieval with a visually grounded model of
  untranscribed speech
Semantic speech retrieval with a visually grounded model of untranscribed speech
Herman Kamper
Gregory Shakhnarovich
Karen Livescu
163
54
0
05 Oct 2017
Analyzing Hidden Representations in End-to-End Automatic Speech
  Recognition Systems
Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems
Yonatan Belinkov
James R. Glass
106
91
0
13 Sep 2017
Encoding of phonology in a recurrent neural model of grounded speech
Encoding of phonology in a recurrent neural model of grounded speechConference on Computational Natural Language Learning (CoNLL), 2017
Afra Alishahi
Marie Barking
Grzegorz Chrupała
177
60
0
12 Jun 2017
Imagination improves Multimodal Translation
Imagination improves Multimodal Translation
Desmond Elliott
Ákos Kádár
295
146
0
11 May 2017
What do Neural Machine Translation Models Learn about Morphology?
What do Neural Machine Translation Models Learn about Morphology?
Yonatan Belinkov
Nadir Durrani
Fahim Dalvi
Hassan Sajjad
James R. Glass
385
428
0
11 Apr 2017
Visually grounded learning of keyword prediction from untranscribed
  speech
Visually grounded learning of keyword prediction from untranscribed speech
Herman Kamper
Shane Settle
Gregory Shakhnarovich
Karen Livescu
251
64
0
23 Mar 2017
Representations of language in a model of visually grounded speech
  signal
Representations of language in a model of visually grounded speech signalAnnual Meeting of the Association for Computational Linguistics (ACL), 2017
Grzegorz Chrupała
Lieke Gelderloos
Afra Alishahi
232
133
0
07 Feb 2017
Learning Word-Like Units from Joint Audio-Visual Analysis
Learning Word-Like Units from Joint Audio-Visual AnalysisAnnual Meeting of the Association for Computational Linguistics (ACL), 2017
David Harwath
James R. Glass
219
107
0
25 Jan 2017
1