From phonemes to images: levels of representation in a recurrent neural model of visually-grounded language learning

International Conference on Computational Linguistics (COLING), 2016

11 October 2016

Lieke Gelderloos

Grzegorz Chrupała

ArXiv (abs)PDF HTML

Papers citing "From phonemes to images: levels of representation in a recurrent neural model of visually-grounded language learning"

22 / 22 papers shown

Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning

Saurabhchand Bhati

Jesús Villalba

Laureano Moro-Velazquez

Thomas Thebaud

Najim Dehak

CLIP

189

08 Sep 2023

Exploring How Generative Adversarial Networks Learn Phonological RepresentationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Jing Chen

Micha Elsner

GAN

119

21 May 2023

Towards visually prompted keyword localisation for zero-resource spoken languagesSpoken Language Technology Workshop (SLT), 2022

Leanne Nortje

Herman Kamper

151

12 Oct 2022

Word Segmentation on Discovered Phone Units with Dynamic Programming and Self-Supervised ScoringIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Herman Kamper

259

24 Feb 2022

Keyword localisation in untranscribed speech using visually grounded speech modelsIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022

Kayode Olaleye

Dan Oneaţă

Herman Kamper

193

02 Feb 2022

Attention-Based Keyword Localisation in Speech using Visual Grounding

Kayode Olaleye

Herman Kamper

109

16 Jun 2021

Probing artificial neural networks: insights from neuroscience

Anna A. Ivanova

John Hewitt

Noga Zaslavsky

172

16 Apr 2021

Talk, Don't Write: A Study of Direct Speech-Based Image RetrievalInterspeech (Interspeech), 2021

192

05 Apr 2021

Towards localisation of keywords in speech using weak supervision

Kayode Olaleye

Benjamin van Niekerk

Herman Kamper

14 Dec 2020

On the Contributions of Visual and Textual Supervision in Low-Resource Semantic Speech Retrieval

134

24 Apr 2019

The emergence of number and syntax units in LSTM language modelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2019

294

180

18 Mar 2019

Analysis Methods in Neural Language Processing: A Survey

Yonatan Belinkov

James R. Glass

275

593

21 Dec 2018

Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input

Antonio Torralba

241

207

04 Apr 2018

Visualisation and 'diagnostic classifiers' reveal how recurrent and recursive neural networks process hierarchical structure

Dieuwke Hupkes

Sara Veldhoen

Willem H. Zuidema

262

295

28 Nov 2017

Semantic speech retrieval with a visually grounded model of untranscribed speech

Herman Kamper

Gregory Shakhnarovich

Karen Livescu

163

05 Oct 2017

Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems

Yonatan Belinkov

James R. Glass

106

13 Sep 2017

Encoding of phonology in a recurrent neural model of grounded speechConference on Computational Natural Language Learning (CoNLL), 2017

Afra Alishahi

Marie Barking

Grzegorz Chrupała

177

12 Jun 2017

Imagination improves Multimodal Translation

Desmond Elliott

Ákos Kádár

295

146

11 May 2017

What do Neural Machine Translation Models Learn about Morphology?

385

428

11 Apr 2017

Visually grounded learning of keyword prediction from untranscribed speech

Herman Kamper

Shane Settle

Gregory Shakhnarovich

Karen Livescu

251

23 Mar 2017

Representations of language in a model of visually grounded speech signalAnnual Meeting of the Association for Computational Linguistics (ACL), 2017

Grzegorz Chrupała

Lieke Gelderloos

Afra Alishahi

232

133

07 Feb 2017

Learning Word-Like Units from Joint Audio-Visual AnalysisAnnual Meeting of the Association for Computational Linguistics (ACL), 2017

David Harwath

James R. Glass

219

107

25 Jan 2017