Segmental Audio Word2Vec: Representing Utterances as Sequences of Vectors with Applications in Spoken Term Detection

7 August 2018

Papers citing "Segmental Audio Word2Vec: Representing Utterances as Sequences of Vectors with Applications in Spoken Term Detection"

19 / 19 papers shown

Title
Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming Simon Malan Benjamin van Niekerk Herman Kamper 30 0 0 22 Sep 2024
Visually grounded few-shot word acquisition with fewer shots Leanne Nortje Benjamin van Niekerk Herman Kamper 28 1 0 25 May 2023
Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model Puyuan Peng Shang-Wen Li Okko Rasanen Abdel-rahman Mohamed David Harwath SSL VLM 33 7 0 19 May 2023
Bootstrapping meaning through listening: Unsupervised learning of spoken sentence embeddings Jian Zhu Zuoyu Tian Yadong Liu Cong Zhang Chia-wen Lo SSL 32 2 0 23 Oct 2022
Spoken Term Detection and Relevance Score Estimation using Dot-Product of Pronunciation Embeddings J. Svec L. Smídl J. Psutka A. Pražák 13 6 0 21 Oct 2022
Deep LSTM Spoken Term Detection using Wav2Vec 2.0 Recognizer J. Svec Jan Lehecka L. Smídl AI4TS 15 3 0 21 Oct 2022
Self-Supervised Speech Representation Learning: A Review Abdel-rahman Mohamed Hung-yi Lee Lasse Borgholt Jakob Drachmann Havtorn Joakim Edin ... Shang-Wen Li Karen Livescu Lars Maaløe Tara N. Sainath Shinji Watanabe SSL AI4TS 137 350 0 21 May 2022
Adding Connectionist Temporal Summarization into Conformer to Improve Its Decoder Efficiency For Speech Recognition N. J. Wang Zongfeng Quan Shaojun Wang Jing Xiao 23 1 0 08 Apr 2022
Word Segmentation on Discovered Phone Units with Dynamic Programming and Self-Supervised Scoring Herman Kamper 34 25 0 24 Feb 2022
Multilingual transfer of acoustic word embeddings improves when training on languages related to the target zero-resource language C. Jacobs Herman Kamper 35 10 0 24 Jun 2021
Unsupervised Automatic Speech Recognition: A Review Hanan Aldarmaki Asad Ullah Nazar Zaki VLM SSL 39 57 0 09 Jun 2021
Multilingual acoustic word embedding models for processing zero-resource languages Herman Kamper Yevgen Matusevych Sharon Goldwater 31 24 0 06 Feb 2020
Towards Unsupervised Speech Recognition and Synthesis with Quantized Speech Representation Learning Alexander H. Liu Tao Tu Hung-yi Lee Lin-Shan Lee SSL 35 50 0 28 Oct 2019
SpeechBERT: An Audio-and-text Jointly Learned Language Model for End-to-end Spoken Question Answering Yung-Sung Chuang Chi-Liang Liu Hung-yi Lee Lin-shan Lee AuLLM 24 39 0 25 Oct 2019
From Semi-supervised to Almost-unsupervised Speech Recognition with Very-low Resource by Jointly Learning Phonetic Structures from Audio and Text Embeddings Yi-Chen Chen Sung-Feng Huang Hung-yi Lee Lin-Shan Lee SSL 14 0 0 10 Apr 2019
Distillation Strategies for Proximal Policy Optimization Sam Green C. Vineyard Ç. Koç 24 8 0 23 Jan 2019
Learning acoustic word embeddings with phonetically associated triplet network Hyungjun Lim Yu Li Yongdong Zhang Junbo Guo Jintao Li 6 5 0 07 Nov 2018
Phonetic-and-Semantic Embedding of Spoken Words with Applications in Spoken Content Retrieval Yi-Chen Chen Sung-Feng Huang Chia-Hao Shen Hung-yi Lee Lin-Shan Lee 46 37 0 21 Jul 2018
Neural Architecture Search with Reinforcement Learning Barret Zoph Quoc V. Le 271 5,327 0 05 Nov 2016