ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.14838
  4. Cited By
ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text
  Translation
v1v2 (latest)

ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation

Neural Information Processing Systems (NeurIPS), 2023
24 May 2023
Chenyang Le
Yao Qian
Long Zhou
Shujie Liu
Yanmin Qian
Michael Zeng
Xuedong Huang
ArXiv (abs)PDFHTML

Papers citing "ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation"

8 / 8 papers shown
Title
Whisper-UT: A Unified Translation Framework for Speech and Text
Whisper-UT: A Unified Translation Framework for Speech and Text
Cihan Xiao
Matthew Wiesner
Debashish Chakraborty
Reno Kriz
Keith Cunningham
Kenton W. Murray
Kevin Duh
Luis Tavarez-Arce
Paul McNamee
Sanjeev Khudanpur
80
0
0
19 Sep 2025
Novel Parasitic Dual-Scale Modeling for Efficient and Accurate Multilingual Speech Translation
Novel Parasitic Dual-Scale Modeling for Efficient and Accurate Multilingual Speech Translation
Chenyang Le
Yinfeng Xia
Huiyan Li
Manhong Wang
Yutao Sun
Xingyang Ma
Yanmin Qian
60
0
0
15 Aug 2025
Enhancing Dialogue Speech Recognition with Robust Contextual Awareness
  via Noise Representation Learning
Enhancing Dialogue Speech Recognition with Robust Contextual Awareness via Noise Representation LearningSIGDIAL Conferences (SIGDIAL), 2024
Wonjun Lee
San Kim
Gary Geunbae Lee
227
0
0
12 Aug 2024
Investigating Decoder-only Large Language Models for Speech-to-text
  Translation
Investigating Decoder-only Large Language Models for Speech-to-text Translation
Chao-Wei Huang
Hui Lu
Hongyu Gong
Hirofumi Inaguma
Ilia Kulikov
Ruslan Mavlyutov
Sravya Popuri
AuLLMLRM
191
13
0
03 Jul 2024
A Mel Spectrogram Enhancement Paradigm Based on CWT in Speech Synthesis
A Mel Spectrogram Enhancement Paradigm Based on CWT in Speech Synthesis
Guoqiang Hu
Huaning Tan
Ruilai Li
251
6
0
18 Jun 2024
TransVIP: Speech to Speech Translation System with Voice and Isochrony
  Preservation
TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation
Chenyang Le
Yao Qian
Dongmei Wang
Long Zhou
Shujie Liu
...
Midia Yousefi
Yanmin Qian
Jinyu Li
Sheng Zhao
Michael Zeng
218
12
0
28 May 2024
GenTranslate: Large Language Models are Generative Multilingual Speech
  and Machine Translators
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine TranslatorsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Yuchen Hu
Chen Chen
Chao-Han Huck Yang
Ruizhe Li
Dong Zhang
Zhehuai Chen
Eng Siong Chng
224
34
0
10 Feb 2024
Sparks of Large Audio Models: A Survey and Outlook
Sparks of Large Audio Models: A Survey and Outlook
S. Latif
Moazzam Shoukat
Fahad Shamshad
Muhammad Usama
Yi Ren
...
Wenwu Wang
Xulong Zhang
Roberto Togneri
Xiaoshi Zhong
Björn W. Schuller
LM&MAAuLLM
577
51
0
24 Aug 2023
1