ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.04675
  4. Cited By
Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based
  Speech Recognition

Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition

5 July 2024
Ye Bai
Jingping Chen
Jitong Chen
Wei Chen
Zhuo Chen
Chuang Ding
Linhao Dong
Qianqian Dong
Yujiao Du
Kepan Gao
Lu Gao
Yi Guo
Minglun Han
Ting-Ting Han
Wenchao Hu
Xinying Hu
Yuxiang Hu
Deyu Hua
Lu Huang
Mingkun Huang
Youjia Huang
Jishuo Jin
Fanliu Kong
Zongwei Lan
Tianyu Li
Xiaoyang Li
Zeyang Li
Zehua Lin
Rui Liu
Shouda Liu
Lu Lu
Yizhou Lu
Jingting Ma
Shengtao Ma
Yulin Pei
Chen Shen
Tian Tan
Xiaogang Tian
Ming Tu
Bo Wang
Hao Wang
Yuping Wang
Yuxuan Wang
Hanzhang Xia
Rui Xia
Shuangyi Xie
Hongmin Xu
Meng Yang
Bihong Zhang
Jun Zhang
Wanyi Zhang
Yang Zhang
Yawei Zhang
Yijie Zheng
Ming Zou
    AuLLM
ArXivPDFHTML

Papers citing "Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition"

8 / 8 papers shown
Title
Teochew-Wild: The First In-the-wild Teochew Dataset with Orthographic Annotations
Teochew-Wild: The First In-the-wild Teochew Dataset with Orthographic Annotations
Linrong Pan
Chenglong Jiang
Gaoze Hou
Ying Gao
38
0
0
08 May 2025
Orchestrate Multimodal Data with Batch Post-Balancing to Accelerate Multimodal Large Language Model Training
Orchestrate Multimodal Data with Batch Post-Balancing to Accelerate Multimodal Large Language Model Training
Yijie Zheng
Bangjun Xiao
Lei Shi
Xiaoyang Li
Faming Wu
Tianyu Li
Xuefeng Xiao
Y. Zhang
Y. Wang
Shouda Liu
MLLM
MoE
50
1
0
31 Mar 2025
HDMoLE: Mixture of LoRA Experts with Hierarchical Routing and Dynamic Thresholds for Fine-Tuning LLM-based ASR Models
HDMoLE: Mixture of LoRA Experts with Hierarchical Routing and Dynamic Thresholds for Fine-Tuning LLM-based ASR Models
Bingshen Mu
Kun Wei
Qijie Shao
Yong Xu
Lei Xie
MoE
31
1
0
30 Sep 2024
Language Model Can Listen While Speaking
Language Model Can Listen While Speaking
Ziyang Ma
Yakun Song
Chenpeng Du
Jian Cong
Zhuo Chen
Yuping Wang
Y. Wang
Xie Chen
AuLLM
29
23
0
05 Aug 2024
Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets
Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets
Xuelong Geng
Tianyi Xu
Kun Wei
Bingshen Mu
Hongfei Xue
...
Pengcheng Guo
Yuhang Dai
Longhao Li
Mingchen Shao
Lei Xie
36
9
0
03 May 2024
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Yu Zhang
Wei Han
James Qin
Yongqiang Wang
Ankur Bapna
...
Pedro J. Moreno
Chung-Cheng Chiu
J. Schalkwyk
Franccoise Beaufays
Yonghui Wu
VLM
77
249
0
02 Mar 2023
FLEURS: Few-shot Learning Evaluation of Universal Representations of
  Speech
FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech
Alexis Conneau
Min Ma
Simran Khanuja
Yu Zhang
Vera Axelrod
Siddharth Dalmia
Jason Riesa
Clara E. Rivera
Ankur Bapna
VLM
78
281
0
25 May 2022
Scaling Laws for Neural Language Models
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
220
3,054
0
23 Jan 2020
1