Models of Visually Grounded Speech Signal Pay Attention To Nouns: a Bilingual Experiment on English and Japanese

8 February 2019

Papers citing "Models of Visually Grounded Speech Signal Pay Attention To Nouns: a Bilingual Experiment on English and Japanese"

13 / 13 papers shown

Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech ModelInterspeech (Interspeech), 2023

335

19 May 2023

M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image RetrievalIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

256

02 Nov 2022

Self-Supervised Speech Representation Learning: A ReviewIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022

Abdel-rahman Mohamed

Hung-yi Lee

Lasse Borgholt

Jakob Drachmann Havtorn

...

796

475

21 May 2022

Learning English with Peppa PigTransactions of the Association for Computational Linguistics (TACL), 2022

Mitja Nikolaus

Afra Alishahi

Grzegorz Chrupała

262

25 Feb 2022

Keyword localisation in untranscribed speech using visually grounded speech modelsIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022

Kayode Olaleye

Dan Oneaţă

Herman Kamper

260

02 Feb 2022

Cascaded Multilingual Audio-Visual Learning from Videos

...

618

08 Nov 2021

Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset

179

14 Oct 2021

Can phones, syllables, and words emerge as side-products of cross-situational audiovisual learning? -- A computational investigation

Khazar Khorrami

Okko Räsänen

298

29 Sep 2021

ZR-2021VG: Zero-Resource Speech Challenge, Visually-Grounded Language Modelling track, 2021 edition

222

14 Jul 2021

Text-Free Image-to-Speech Synthesis Using Learned Segmental UnitsAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

227

31 Dec 2020

Catplayinginthesnow: Impact of Prior Segmentation on a Model of Visually Grounded Speech

William N. Havard

Jean-Pierre Chevrot

Laurent Besacier

242

15 Jun 2020

Learning Hierarchical Discrete Linguistic Units from Visually-Grounded SpeechInternational Conference on Learning Representations (ICLR), 2019

David Harwath

Wei-Ning Hsu

James R. Glass

284

21 Nov 2019

Word Recognition, Competition, and Activation in a Model of Visually Grounded SpeechConference on Computational Natural Language Learning (CoNLL), 2019

William N. Havard

Jean-Pierre Chevrot

Laurent Besacier

153

18 Sep 2019