ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.02499
  4. Cited By
A Weakly-Supervised Streaming Multilingual Speech Model with Truly
  Zero-Shot Capability

A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability

4 November 2022
Jian Xue
Peidong Wang
Jinyu Li
Eric Sun
ArXivPDFHTML

Papers citing "A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability"

10 / 10 papers shown
Title
CTC-GMM: CTC guided modality matching for fast and accurate streaming
  speech translation
CTC-GMM: CTC guided modality matching for fast and accurate streaming speech translation
Rui Zhao
Jinyu Li
Ruchao Fan
Matt Post
36
1
0
07 Oct 2024
Target word activity detector: An approach to obtain ASR word boundaries
  without lexicon
Target word activity detector: An approach to obtain ASR word boundaries without lexicon
S. Sivasankaran
Eric Sun
Jinyu Li
Yan-ping Huang
Jing Pan
30
0
0
20 Sep 2024
Soft Language Identification for Language-Agnostic Many-to-One
  End-to-End Speech Translation
Soft Language Identification for Language-Agnostic Many-to-One End-to-End Speech Translation
Peidong Wang
Jian Xue
Jinyu Li
Junkun Chen
Aswin Shanmugam Subramanian
20
0
0
12 Jun 2024
Improving Stability in Simultaneous Speech Translation: A
  Revision-Controllable Decoding Approach
Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach
Junkun Chen
Jian Xue
Peidong Wang
Jing Pan
Jinyu Li
11
2
0
06 Oct 2023
DiariST: Streaming Speech Translation with Speaker Diarization
DiariST: Streaming Speech Translation with Speaker Diarization
Muqiao Yang
Naoyuki Kanda
Xiaofei Wang
Junkun Chen
Peidong Wang
Jian Xue
Jinyu Li
Takuya Yoshioka
11
6
0
14 Sep 2023
On decoder-only architecture for speech-to-text and large language model
  integration
On decoder-only architecture for speech-to-text and large language model integration
Jian Wu
Yashesh Gaur
Zhuo Chen
Long Zhou
Yilun Zhu
...
Jinyu Li
Shujie Liu
Bo Ren
Linquan Liu
Yu-Huan Wu
AuLLM
22
117
0
08 Jul 2023
Token-Level Serialized Output Training for Joint Streaming ASR and ST
  Leveraging Textual Alignments
Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments
Sara Papi
Peidong Wan
Junkun Chen
Jian Xue
Jinyu Li
Yashesh Gaur
21
8
0
07 Jul 2023
ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text
  Translation
ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation
Chenyang Le
Yao Qian
Long Zhou
Shujie Liu
Yanmin Qian
Michael Zeng
Xuedong Huang
17
12
0
24 May 2023
Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot
  Task Generalization
Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization
Puyuan Peng
Brian Yan
Shinji Watanabe
David F. Harwath
VLM
LRM
30
46
0
18 May 2023
A Configurable Multilingual Model is All You Need to Recognize All
  Languages
A Configurable Multilingual Model is All You Need to Recognize All Languages
Long Zhou
Jinyu Li
Eric Sun
Shujie Liu
92
40
0
13 Jul 2021
1