Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1808.02228
Cited By
Segmental Audio Word2Vec: Representing Utterances as Sequences of Vectors with Applications in Spoken Term Detection
7 August 2018
Yu-Hsuan Wang
Hung-yi Lee
Lin-Shan Lee
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Segmental Audio Word2Vec: Representing Utterances as Sequences of Vectors with Applications in Spoken Term Detection"
19 / 19 papers shown
Title
Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming
Simon Malan
Benjamin van Niekerk
Herman Kamper
30
0
0
22 Sep 2024
Visually grounded few-shot word acquisition with fewer shots
Leanne Nortje
Benjamin van Niekerk
Herman Kamper
28
1
0
25 May 2023
Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model
Puyuan Peng
Shang-Wen Li
Okko Rasanen
Abdel-rahman Mohamed
David Harwath
SSL
VLM
33
7
0
19 May 2023
Bootstrapping meaning through listening: Unsupervised learning of spoken sentence embeddings
Jian Zhu
Zuoyu Tian
Yadong Liu
Cong Zhang
Chia-wen Lo
SSL
32
2
0
23 Oct 2022
Spoken Term Detection and Relevance Score Estimation using Dot-Product of Pronunciation Embeddings
J. Svec
L. Smídl
J. Psutka
A. Pražák
13
6
0
21 Oct 2022
Deep LSTM Spoken Term Detection using Wav2Vec 2.0 Recognizer
J. Svec
Jan Lehecka
L. Smídl
AI4TS
15
3
0
21 Oct 2022
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSL
AI4TS
137
350
0
21 May 2022
Adding Connectionist Temporal Summarization into Conformer to Improve Its Decoder Efficiency For Speech Recognition
N. J. Wang
Zongfeng Quan
Shaojun Wang
Jing Xiao
23
1
0
08 Apr 2022
Word Segmentation on Discovered Phone Units with Dynamic Programming and Self-Supervised Scoring
Herman Kamper
34
25
0
24 Feb 2022
Multilingual transfer of acoustic word embeddings improves when training on languages related to the target zero-resource language
C. Jacobs
Herman Kamper
35
10
0
24 Jun 2021
Unsupervised Automatic Speech Recognition: A Review
Hanan Aldarmaki
Asad Ullah
Nazar Zaki
VLM
SSL
39
57
0
09 Jun 2021
Multilingual acoustic word embedding models for processing zero-resource languages
Herman Kamper
Yevgen Matusevych
Sharon Goldwater
31
24
0
06 Feb 2020
Towards Unsupervised Speech Recognition and Synthesis with Quantized Speech Representation Learning
Alexander H. Liu
Tao Tu
Hung-yi Lee
Lin-Shan Lee
SSL
35
50
0
28 Oct 2019
SpeechBERT: An Audio-and-text Jointly Learned Language Model for End-to-end Spoken Question Answering
Yung-Sung Chuang
Chi-Liang Liu
Hung-yi Lee
Lin-shan Lee
AuLLM
24
39
0
25 Oct 2019
From Semi-supervised to Almost-unsupervised Speech Recognition with Very-low Resource by Jointly Learning Phonetic Structures from Audio and Text Embeddings
Yi-Chen Chen
Sung-Feng Huang
Hung-yi Lee
Lin-Shan Lee
SSL
14
0
0
10 Apr 2019
Distillation Strategies for Proximal Policy Optimization
Sam Green
C. Vineyard
Ç. Koç
24
8
0
23 Jan 2019
Learning acoustic word embeddings with phonetically associated triplet network
Hyungjun Lim
Yu Li
Yongdong Zhang
Junbo Guo
Jintao Li
6
5
0
07 Nov 2018
Phonetic-and-Semantic Embedding of Spoken Words with Applications in Spoken Content Retrieval
Yi-Chen Chen
Sung-Feng Huang
Chia-Hao Shen
Hung-yi Lee
Lin-Shan Lee
46
37
0
21 Jul 2018
Neural Architecture Search with Reinforcement Learning
Barret Zoph
Quoc V. Le
271
5,327
0
05 Nov 2016
1