Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.06434
Cited By
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
10 October 2023
S. Radhakrishnan
Chao-Han Huck Yang
S. Khan
Rohit Kumar
N. Kiani
D. Gómez-Cabrero
Jesper N. Tegnér
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition"
10 / 10 papers shown
Title
Can Large Language Models Understand Spatial Audio?
Changli Tang
Wenyi Yu
Guangzhi Sun
Xianzhao Chen
Tian Tan
...
Jun Zhang
Lu Lu
Zejun Ma
Yuxuan Wang
Chao Zhang
31
4
0
12 Jun 2024
Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
Yuanchao Li
Peter Bell
Catherine Lai
36
9
0
12 Jun 2024
1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem
Mingjie Chen
Hezhao Zhang
Yuanchao Li
Jiachen Luo
Wen Wu
...
Lin Wang
P. Woodland
Xie Chen
Huy P Phan
Thomas Hain
18
0
0
30 May 2024
Crossmodal ASR Error Correction with Discrete Speech Units
Yuanchao Li
Pinzhen Chen
Peter Bell
Catherine Lai
23
6
0
26 May 2024
Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Zijin Gu
Tatiana Likhomanenko
Richard He Bai
Erik McDermott
R. Collobert
Navdeep Jaitly
AuLLM
43
2
0
24 May 2024
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition
Chen Chen
Ruizhe Li
Yuchen Hu
Sabato Marco Siniscalchi
Pin-Yu Chen
Ensiong Chng
Chao-Han Huck Yang
24
19
0
08 Feb 2024
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition
Yuchen Hu
Chen Chen
Chao-Han Huck Yang
Ruizhe Li
Chao Zhang
Pin-Yu Chen
Ensiong Chng
15
20
0
19 Jan 2024
RescoreBERT: Discriminative Speech Recognition Rescoring with BERT
Liyan Xu
Yile Gu
J. Kolehmainen
Haidar Khan
Ankur Gandhe
Ariya Rastrow
A. Stolcke
I. Bulyko
25
45
0
02 Feb 2022
ASR Rescoring and Confidence Estimation with ELECTRA
Hayato Futami
H. Inaguma
Masato Mimura
S. Sakai
Tatsuya Kawahara
KELM
51
20
0
05 Oct 2021
Domain-aware Neural Language Models for Speech Recognition
Linda Liu
Yile Gu
Aditya Gourav
Ankur Gandhe
Shashank Kalmane
Denis Filimonov
Ariya Rastrow
I. Bulyko
20
21
0
05 Jan 2021
1