v1v2 (latest)

Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data

Interspeech (Interspeech), 2022

31 March 2022

Haizhou Li

ArXiv (abs)PDF HTML Github (1357★)

Papers citing "Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data"

15 / 15 papers shown

Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual InputsNeural Information Processing Systems (NeurIPS), 2024

431

04 Nov 2024

JOOCI: a Framework for Learning Comprehensive Speech Representations

Hemant Yadav

R. Shah

Sunayana Sitaram

404

14 Oct 2024

Compact Speech Translation Models via Discrete Speech Units Pretraining

Tsz Kin Lam

Alexandra Birch

Barry Haddow

390

29 Feb 2024

Prompting and Adapter Tuning for Self-supervised Encoder-Decoder Speech ModelAutomatic Speech Recognition & Understanding (ASRU), 2023

Hung-yi Lee

398

04 Oct 2023

Decoupled Structure for Improved Adaptability of End-to-End ModelsSpeech Communication (Speech Commun.), 2023

Keqi Deng

P. Woodland

AuLLM

296

25 Aug 2023

Speech Corpora Divergence Based Unsupervised Data Selection for ASR

Changfeng Gao

Gaofeng Cheng

Pengyuan Zhang

Yonghong Yan

221

26 Feb 2023

Pre-training for Speech Translation: CTC Meets Optimal TransportInternational Conference on Machine Learning (ICML), 2023

449

27 Jan 2023

MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech RecognitionInterspeech (Interspeech), 2022

Xiaohuan Zhou

Jiaming Wang

Zeyu Cui

Shiliang Zhang

Zhijie Yan

Jingren Zhou

Chang Zhou

284

29 Nov 2022

Channel-Aware Pretraining of Joint Encoder-Decoder Self-Supervised Model for Telephonic-Speech ASR

Vrunda N. Sukhadia

Anjana Arunkumar

S. Umesh

188

03 Nov 2022

Bootstrapping meaning through listening: Unsupervised learning of spoken sentence embeddingsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

277

23 Oct 2022

CTCBERT: Advancing Hidden-unit BERT with CTC ObjectivesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

345

16 Oct 2022

CoBERT: Self-Supervised Speech Representation Learning Through Code Representation LearningInterspeech (Interspeech), 2022

Haizhou Li

313

08 Oct 2022

SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-trainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

329

07 Oct 2022

The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared TaskInternational Workshop on Spoken Language Translation (IWSLT), 2022

242

12 Jun 2022

Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo LanguagesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Kwangyoun Kim

280

02 May 2022