Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2203.17113
Cited By
v1
v2 (latest)
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data
Interspeech (Interspeech), 2022
31 March 2022
Junyi Ao
Zi-Hua Zhang
Long Zhou
Shujie Liu
Haizhou Li
Tom Ko
Lirong Dai
Jinyu Li
Yao Qian
Furu Wei
SSL
Re-assign community
ArXiv (abs)
PDF
HTML
Github (1357★)
Papers citing
"Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data"
15 / 15 papers shown
Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs
Neural Information Processing Systems (NeurIPS), 2024
A. Haliassos
Rodrigo Mira
Honglie Chen
Zoe Landgraf
Stavros Petridis
Maja Pantic
SSL
431
17
0
04 Nov 2024
JOOCI: a Framework for Learning Comprehensive Speech Representations
Hemant Yadav
R. Shah
Sunayana Sitaram
404
0
0
14 Oct 2024
Compact Speech Translation Models via Discrete Speech Units Pretraining
Tsz Kin Lam
Alexandra Birch
Barry Haddow
390
3
0
29 Feb 2024
Prompting and Adapter Tuning for Self-supervised Encoder-Decoder Speech Model
Automatic Speech Recognition & Understanding (ASRU), 2023
Kai-Wei Chang
Ming-Hsin Chen
Yun-Ping Lin
Jing Neng Hsu
Paul Kuo-Ming Huang
Chien-yu Huang
Shang-Wen Li
Hung-yi Lee
398
7
0
04 Oct 2023
Decoupled Structure for Improved Adaptability of End-to-End Models
Speech Communication (Speech Commun.), 2023
Keqi Deng
P. Woodland
AuLLM
296
7
0
25 Aug 2023
Speech Corpora Divergence Based Unsupervised Data Selection for ASR
Changfeng Gao
Gaofeng Cheng
Pengyuan Zhang
Yonghong Yan
221
1
0
26 Feb 2023
Pre-training for Speech Translation: CTC Meets Optimal Transport
International Conference on Machine Learning (ICML), 2023
Hang Le
Hongyu Gong
Changhan Wang
J. Pino
Benjamin Lecouteux
D. Schwab
OT
449
32
0
27 Jan 2023
MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition
Interspeech (Interspeech), 2022
Xiaohuan Zhou
Jiaming Wang
Zeyu Cui
Shiliang Zhang
Zhijie Yan
Jingren Zhou
Chang Zhou
284
13
0
29 Nov 2022
Channel-Aware Pretraining of Joint Encoder-Decoder Self-Supervised Model for Telephonic-Speech ASR
Vrunda N. Sukhadia
Anjana Arunkumar
S. Umesh
188
1
0
03 Nov 2022
Bootstrapping meaning through listening: Unsupervised learning of spoken sentence embeddings
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Jian Zhu
Zuoyu Tian
Yadong Liu
Cong Zhang
Chia-wen Lo
SSL
277
2
0
23 Oct 2022
CTCBERT: Advancing Hidden-unit BERT with CTC Objectives
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Ruchao Fan
Yiming Wang
Yashesh Gaur
Jinyu Li
345
8
0
16 Oct 2022
CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
Interspeech (Interspeech), 2022
Chutong Meng
Junyi Ao
Tom Ko
Mingxuan Wang
Haizhou Li
SSL
313
7
0
08 Oct 2022
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Zi-Hua Zhang
Long Zhou
Junyi Ao
Shujie Liu
Lirong Dai
Jinyu Li
Furu Wei
329
64
0
07 Oct 2022
The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task
International Workshop on Spoken Language Translation (IWSLT), 2022
Ziqiang Zhang
Junyi Ao
Long Zhou
Shujie Liu
Furu Wei
Jinyu Li
242
9
0
12 Jun 2022
Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Felix Wu
Kwangyoun Kim
Shinji Watanabe
Kyu Jeong Han
Ryan T. McDonald
Kilian Q. Weinberger
Yoav Artzi
SyDa
280
46
0
02 May 2022
1
Page 1 of 1