v1v2v3 (latest)

CVSS Corpus and Massively Multilingual Speech-to-Speech Translation

International Conference on Language Resources and Evaluation (LREC), 2022

11 January 2022

Yeting Jia

Michelle Tadmor Ramanovich

Papers citing "CVSS Corpus and Massively Multilingual Speech-to-Speech Translation"

50 / 54 papers shown

Improving Direct Persian-English Speech-to-Speech Translation with Discrete Units and Synthetic Parallel DataPhysical Review X (PRX), 2025

Sina Rashidi

Hossein Sameti

114

16 Nov 2025

MTP-S2UT: Enhancing Speech-to-Speech Translation Quality with Multi-token Prediction

144

11 Oct 2025

UniSS: Unified Expressive Speech-to-Speech Translation with Your Voice

193

25 Sep 2025

MLLM-based Speech Recognition: When and How is Multimodality Beneficial?

274

25 Jul 2025

Step-Audio 2 Technical Report

...

352

22 Jul 2025

Scheduled Interleaved Speech-Text Training for Speech-to-Speech Translation with LLMs

287

12 Jun 2025

Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data Restoration

1.1K

07 May 2025

Universal Speech Token Learning via Low-Bitrate Neural Codec and Pretrained RepresentationsIEEE Journal on Selected Topics in Signal Processing (JSTSP), 2024

415

15 Mar 2025

Audio-FLAN: A Preliminary Release

...

325

23 Feb 2025

High-Fidelity Simultaneous Speech-To-Speech Translation

1.1K

05 Feb 2025

Recent Advances in Speech Language Models: A SurveyAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

717

01 Oct 2024

Multi-modal Speech Transformer Decoders: When Do Multiple Modalities Improve Accuracy?

Yiwen Guan

V. Trinh

Vivek Voleti

Jacob Whitehill

348

13 Sep 2024

Analyzing Speech Unit Selection for Textless Speech-to-Speech Translation

J. Duret

Yannick Esteve

Titouan Parcollet

241

08 Jul 2024

MSR-86K: An Evolving, Multilingual Corpus with 86,300 Hours of Transcribed Audio for Speech Recognition Research

Ke Ding

Guanglu Wan

250

26 Jun 2024

Diffusion Synthesizer for Efficient Multilingual Speech to Speech TranslationInterspeech (Interspeech), 2024

Nameer Hirschkind

Xiao Yu

Joseph Liu

Eloi DuBois

...

189

14 Jun 2024

TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation

...

376

28 May 2024

CrossVoice: Crosslingual Prosody Preserving Cascade-S2ST using Transfer Learning

Medha Hira

Arnav Goel

Anubha Gupta

298

23 May 2024

DiffNorm: Self-Supervised Normalization for Non-autoregressive Speech-to-speech Translation

Weiting Tan

Jingyu Zhang

Lingfeng Shen

Daniel Khashabi

Philipp Koehn

286

22 May 2024

FFSTC: Fongbe to French Speech Translation CorpusInternational Conference on Language Resources and Evaluation (LREC), 2024

D. F. Kponou

F. Laleye

E. C. Ezin

243

08 Mar 2024

Direct Punjabi to English speech translation using discrete units

Prabhjot Kaur

L. A. M. Bush

Weisong Shi

253

25 Feb 2024

TranSentence: Speech-to-speech Translation via Language-agnostic Sentence-level Speech Encoding without Language-parallel DataIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Seung-Bin Kim

Sang-Hoon Lee

Seong-Whan Lee

214

17 Jan 2024

DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech TranslationNeural Information Processing Systems (NeurIPS), 2023

Qingkai Fang

Yan Zhou

Yangzhou Feng

254

11 Oct 2023

Speech-to-Speech Translation with Discrete-Unit-Based Style TransferAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Rongjie Huang

Zhou Zhao

202

14 Sep 2023

Direct Text to Speech Translation System using Acoustic UnitsIEEE Signal Processing Letters (IEEE SPL), 2023

184

14 Sep 2023

Sparks of Large Audio Models: A Survey and Outlook

...

Björn W. Schuller

821

24 Aug 2023

Many-to-Many Spoken Language Translation via Unified Speech and Text Representation Learning with Unit-to-Unit TranslationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

265

03 Aug 2023

Multilingual Speech-to-Speech Translation into Multiple Target Languages

236

17 Jul 2023

Towards cross-language prosody transfer for dialogInterspeech (Interspeech), 2023

Jonathan Avila

Nigel G. Ward

317

09 Jul 2023

AudioPaLM: A Large Language Model That Can Speak and Listen

Paul Kishan Rubenstein

Chulayuth Asawaroengchai

...

418

425

22 Jun 2023

HK-LegiCoST: Leveraging Non-Verbatim Transcripts for Speech TranslationInterspeech (Interspeech), 2023

Sanjeev Khudanpur

264

20 Jun 2023

PolyVoice: Language Models for Speech to Speech TranslationInternational Conference on Learning Representations (ICLR), 2023

...

Yuxuan Wang

339

05 Jun 2023

Translatotron 3: Speech to Speech Translation with Monolingual DataIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Eliya Nachmani

Alon Levkovitch

Yi-Yang Ding

Chulayutsh Asawaroengchai

Heiga Zen

Michelle Tadmor Ramanovich

382

27 May 2023

AV-TranSpeech: Audio-Visual Robust Speech-to-Speech TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Rongjie Huang

Xize Cheng

...

Zhou Zhao

257

24 May 2023

i-Code Studio: A Configurable and Composable Framework for Integrative AIConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

...

Lu Yuan

238

23 May 2023

Duplex Diffusion Models Improve Speech-to-Speech TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Xianchao Wu

DiffM

263

22 May 2023

MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting

183

19 May 2023

Improving Cascaded Unsupervised Speech Translation with Denoising Back-translation

Jiatong Shi

168

12 May 2023

Enhancing Speech-to-Speech Translation with Multiple TTS TargetsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Jiatong Shi

183

10 Apr 2023

ESPnet-ST-v2: Multipurpose Spoken Language Translation ToolkitAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Jiatong Shi

...

278

10 Apr 2023

Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling

...

426

252

07 Mar 2023

NusaCrowd: Open Source Initiative for Indonesian NLP ResourcesAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

...

536

19 Dec 2022

UnitY: Two-pass Direct Speech-to-speech Translation with Discrete UnitsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

397

15 Dec 2022

VideoDubber: Machine Translation with Speech-Aware Length Control for Video DubbingAAAI Conference on Artificial Intelligence (AAAI), 2022

Junliang Guo

Jiang Bian

190

30 Nov 2022

Dialogs Re-enacted Across Languages

246

18 Nov 2022

Speech-to-Speech Translation For A Real-world Unwritten LanguageAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

...

378

11 Nov 2022

SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech TranslationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Paul-Ambroise Duquenne

297

08 Nov 2022

Textless Direct Speech-to-Speech Translation with Discrete Speech RepresentationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Xinjian Li

Ye Jia

Chung-Cheng Chiu

323

31 Oct 2022

Joint Pre-Training with Speech and Bilingual Text for Direct Speech to Speech TranslationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

214

31 Oct 2022

A Textless Metric for Speech-to-Speech Comparison

326

21 Oct 2022

Simple and Effective Unsupervised Speech TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

275

18 Oct 2022