ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2008.06258
  4. Cited By
Unsupervised vs. transfer learning for multimodal one-shot matching of
  speech and images

Unsupervised vs. transfer learning for multimodal one-shot matching of speech and images

14 August 2020
Leanne Nortje
Herman Kamper
    SSL
ArXiv (abs)PDFHTML

Papers citing "Unsupervised vs. transfer learning for multimodal one-shot matching of speech and images"

8 / 8 papers shown
Visually grounded few-shot word learning in low-resource settings
Visually grounded few-shot word learning in low-resource settingsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Leanne Nortje
Dan Oneaţă
Herman Kamper
VLM
216
4
0
20 Jun 2023
Visually grounded few-shot word acquisition with fewer shots
Visually grounded few-shot word acquisition with fewer shotsInterspeech (Interspeech), 2023
Leanne Nortje
Benjamin van Niekerk
Herman Kamper
157
1
0
25 May 2023
Towards visually prompted keyword localisation for zero-resource spoken
  languages
Towards visually prompted keyword localisation for zero-resource spoken languagesSpoken Language Technology Workshop (SLT), 2022
Leanne Nortje
Herman Kamper
157
6
0
12 Oct 2022
YFACC: A Yorùbá speech-image dataset for cross-lingual keyword
  localisation through visual grounding
YFACC: A Yorùbá speech-image dataset for cross-lingual keyword localisation through visual groundingSpoken Language Technology Workshop (SLT), 2022
Kayode Olaleye
Dan Oneaţă
Herman Kamper
ObjD
226
8
0
10 Oct 2022
Keyword localisation in untranscribed speech using visually grounded
  speech models
Keyword localisation in untranscribed speech using visually grounded speech modelsIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022
Kayode Olaleye
Dan Oneaţă
Herman Kamper
206
7
0
02 Feb 2022
Attention-Based Keyword Localisation in Speech using Visual Grounding
Attention-Based Keyword Localisation in Speech using Visual Grounding
Kayode Olaleye
Herman Kamper
129
13
0
16 Jun 2021
A Multiple Classifier Approach for Concatenate-Designed Neural Networks
A Multiple Classifier Approach for Concatenate-Designed Neural Networks
Ka‐Hou Chan
S. Im
Wei Ke
132
23
0
14 Jan 2021
Direct multimodal few-shot learning of speech and images
Direct multimodal few-shot learning of speech and imagesInterspeech (Interspeech), 2020
Leanne Nortje
Herman Kamper
SSL
316
10
0
10 Dec 2020
1
Page 1 of 1