Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.14036
Cited By
Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generator
27 February 2023
Vladimir Bataev
Roman Korostik
Evgeny Shabalin
Vitaly Lavrukhin
Boris Ginsburg
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generator"
12 / 12 papers shown
Title
Effective Text Adaptation for LLM-based ASR through Soft Prompt Fine-Tuning
Yingyi Ma
Zhe Liu
Ozlem Kalinli
65
0
0
09 Dec 2024
AMPS: ASR with Multimodal Paraphrase Supervision
Amruta Parulekar
Abhishek Gupta
Sameep Chattopadhyay
P. Jyothi
75
0
0
27 Nov 2024
Hard-Synth: Synthesizing Diverse Hard Samples for ASR using Zero-Shot TTS and LLM
Jiawei Yu
Y. Li
Xiaosong Qiao
Huan Zhao
Xiaofeng Zhao
Wei Tang
M. Zhang
Hao Yang
Jinsong Su
68
0
0
20 Nov 2024
Parameter-efficient Adaptation of Multilingual Multimodal Models for Low-resource ASR
Abhishek Gupta
Amruta Parulekar
Sameep Chattopadhyay
P. Jyothi
VLM
26
0
0
17 Oct 2024
Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition
Hsuan Su
Hua Farn
Fan-Yun Sun
Shang-Tse Chen
Hung-yi Lee
MoMe
24
2
0
05 Jun 2024
Wiki-En-ASR-Adapt: Large-scale synthetic dataset for English ASR Customization
Alexandra Antonova
26
0
0
29 Sep 2023
Corpus Synthesis for Zero-shot ASR domain Adaptation using Large Language Models
Hsuan Su
Ting-Yao Hu
H. Koppula
Raviteja Vemulapalli
Jen-Hao Rick Chang
Karren D. Yang
G. Mantena
Oncel Tuzel
SyDa
20
1
0
18 Sep 2023
Text Injection for Capitalization and Turn-Taking Prediction in Speech Models
Shaan Bijwadia
Shuo-yiin Chang
Weiran Wang
Zhong Meng
Hao Zhang
Tara N. Sainath
6
1
0
14 Aug 2023
Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer
Lu Huang
B. Li
Jun Zhang
Lu Lu
Zejun Ma
14
2
0
07 Jun 2023
Synt++: Utilizing Imperfect Synthetic Data to Improve Speech Recognition
Ting-Yao Hu
Mohammadreza Armandpour
A. Shrivastava
Jen-Hao Rick Chang
H. Koppula
Oncel Tuzel
SyDa
44
42
0
21 Oct 2021
CTC Variations Through New WFST Topologies
A. Laptev
Somshubra Majumdar
Boris Ginsburg
19
20
0
06 Oct 2021
NeMo: a toolkit for building AI applications using Neural Modules
Oleksii Kuchaiev
Jason Chun Lok Li
Huyen Nguyen
Oleksii Hrinchuk
Ryan Leary
...
Jack Cook
P. Castonguay
Mariya Popova
Jocelyn Huang
Jonathan M. Cohen
177
287
0
14 Sep 2019
1