Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.09618
Cited By
Multi-Modal Retrieval For Large Language Model Based Speech Recognition
13 June 2024
J. Kolehmainen
Aditya Gourav
Prashanth Gurunath Shivakumar
Yile Gu
Ankur Gandhe
Ariya Rastrow
Grant P. Strimel
I. Bulyko
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multi-Modal Retrieval For Large Language Model Based Speech Recognition"
6 / 6 papers shown
Title
Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering
Junxiao Xue
Quan Deng
Fei Yu
Yanhao Wang
Jun Wang
Y. Li
VLM
43
3
0
31 Dec 2024
Optimizing Contextual Speech Recognition Using Vector Quantization for Efficient Retrieval
Nikolaos Flemotomos
Roger Hsiao
P. Swietojanski
Takaaki Hori
Dogan Can
Xiaodan Zhuang
44
0
0
01 Nov 2024
Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent
Shanbo Cheng
Zhichao Huang
Tom Ko
Hang Li
Ningxin Peng
Lu Xu
Qini Zhang
48
3
0
31 Jul 2024
FoundationTTS: Text-to-Speech for ASR Customization with Generative Language Model
Rui Xue
Yanqing Liu
Lei He
Xuejiao Tan
Linquan Liu
Ed Lin
Sheng Zhao
26
7
0
06 Mar 2023
Training Language Models with Memory Augmentation
Zexuan Zhong
Tao Lei
Danqi Chen
RALM
232
127
0
25 May 2022
Domain-aware Neural Language Models for Speech Recognition
Linda Liu
Yile Gu
Aditya Gourav
Ankur Gandhe
Shashank Kalmane
Denis Filimonov
Ariya Rastrow
I. Bulyko
28
21
0
05 Jan 2021
1