ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.02228
  4. Cited By
Segmental Audio Word2Vec: Representing Utterances as Sequences of
  Vectors with Applications in Spoken Term Detection

Segmental Audio Word2Vec: Representing Utterances as Sequences of Vectors with Applications in Spoken Term Detection

7 August 2018
Yu-Hsuan Wang
Hung-yi Lee
Lin-Shan Lee
ArXivPDFHTML

Papers citing "Segmental Audio Word2Vec: Representing Utterances as Sequences of Vectors with Applications in Spoken Term Detection"

19 / 19 papers shown
Title
Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming
Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming
Simon Malan
Benjamin van Niekerk
Herman Kamper
30
0
0
22 Sep 2024
Visually grounded few-shot word acquisition with fewer shots
Visually grounded few-shot word acquisition with fewer shots
Leanne Nortje
Benjamin van Niekerk
Herman Kamper
28
1
0
25 May 2023
Syllable Discovery and Cross-Lingual Generalization in a Visually
  Grounded, Self-Supervised Speech Model
Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model
Puyuan Peng
Shang-Wen Li
Okko Rasanen
Abdel-rahman Mohamed
David Harwath
SSL
VLM
33
7
0
19 May 2023
Bootstrapping meaning through listening: Unsupervised learning of spoken
  sentence embeddings
Bootstrapping meaning through listening: Unsupervised learning of spoken sentence embeddings
Jian Zhu
Zuoyu Tian
Yadong Liu
Cong Zhang
Chia-wen Lo
SSL
32
2
0
23 Oct 2022
Spoken Term Detection and Relevance Score Estimation using Dot-Product
  of Pronunciation Embeddings
Spoken Term Detection and Relevance Score Estimation using Dot-Product of Pronunciation Embeddings
J. Svec
L. Smídl
J. Psutka
A. Pražák
13
6
0
21 Oct 2022
Deep LSTM Spoken Term Detection using Wav2Vec 2.0 Recognizer
Deep LSTM Spoken Term Detection using Wav2Vec 2.0 Recognizer
J. Svec
Jan Lehecka
L. Smídl
AI4TS
15
3
0
21 Oct 2022
Self-Supervised Speech Representation Learning: A Review
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSL
AI4TS
137
350
0
21 May 2022
Adding Connectionist Temporal Summarization into Conformer to Improve
  Its Decoder Efficiency For Speech Recognition
Adding Connectionist Temporal Summarization into Conformer to Improve Its Decoder Efficiency For Speech Recognition
N. J. Wang
Zongfeng Quan
Shaojun Wang
Jing Xiao
23
1
0
08 Apr 2022
Word Segmentation on Discovered Phone Units with Dynamic Programming and
  Self-Supervised Scoring
Word Segmentation on Discovered Phone Units with Dynamic Programming and Self-Supervised Scoring
Herman Kamper
34
25
0
24 Feb 2022
Multilingual transfer of acoustic word embeddings improves when training
  on languages related to the target zero-resource language
Multilingual transfer of acoustic word embeddings improves when training on languages related to the target zero-resource language
C. Jacobs
Herman Kamper
35
10
0
24 Jun 2021
Unsupervised Automatic Speech Recognition: A Review
Unsupervised Automatic Speech Recognition: A Review
Hanan Aldarmaki
Asad Ullah
Nazar Zaki
VLM
SSL
39
57
0
09 Jun 2021
Multilingual acoustic word embedding models for processing zero-resource
  languages
Multilingual acoustic word embedding models for processing zero-resource languages
Herman Kamper
Yevgen Matusevych
Sharon Goldwater
31
24
0
06 Feb 2020
Towards Unsupervised Speech Recognition and Synthesis with Quantized
  Speech Representation Learning
Towards Unsupervised Speech Recognition and Synthesis with Quantized Speech Representation Learning
Alexander H. Liu
Tao Tu
Hung-yi Lee
Lin-Shan Lee
SSL
35
50
0
28 Oct 2019
SpeechBERT: An Audio-and-text Jointly Learned Language Model for
  End-to-end Spoken Question Answering
SpeechBERT: An Audio-and-text Jointly Learned Language Model for End-to-end Spoken Question Answering
Yung-Sung Chuang
Chi-Liang Liu
Hung-yi Lee
Lin-shan Lee
AuLLM
24
39
0
25 Oct 2019
From Semi-supervised to Almost-unsupervised Speech Recognition with
  Very-low Resource by Jointly Learning Phonetic Structures from Audio and Text
  Embeddings
From Semi-supervised to Almost-unsupervised Speech Recognition with Very-low Resource by Jointly Learning Phonetic Structures from Audio and Text Embeddings
Yi-Chen Chen
Sung-Feng Huang
Hung-yi Lee
Lin-Shan Lee
SSL
14
0
0
10 Apr 2019
Distillation Strategies for Proximal Policy Optimization
Distillation Strategies for Proximal Policy Optimization
Sam Green
C. Vineyard
Ç. Koç
24
8
0
23 Jan 2019
Learning acoustic word embeddings with phonetically associated triplet
  network
Learning acoustic word embeddings with phonetically associated triplet network
Hyungjun Lim
Yu Li
Yongdong Zhang
Junbo Guo
Jintao Li
6
5
0
07 Nov 2018
Phonetic-and-Semantic Embedding of Spoken Words with Applications in
  Spoken Content Retrieval
Phonetic-and-Semantic Embedding of Spoken Words with Applications in Spoken Content Retrieval
Yi-Chen Chen
Sung-Feng Huang
Chia-Hao Shen
Hung-yi Lee
Lin-Shan Lee
46
37
0
21 Jul 2018
Neural Architecture Search with Reinforcement Learning
Neural Architecture Search with Reinforcement Learning
Barret Zoph
Quoc V. Le
271
5,327
0
05 Nov 2016
1