Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.07048
Cited By
v1
v2 (latest)
Progressive Joint Modeling in Unsupervised Single-channel Overlapped Speech Recognition
21 July 2017
Zhehuai Chen
J. Droppo
Jinyu Li
Wayne Xiong
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Progressive Joint Modeling in Unsupervised Single-channel Overlapped Speech Recognition"
31 / 31 papers shown
Title
Advancing Multi-talker ASR Performance with Large Language Models
Mohan Shi
Zengrui Jin
Yaoxun Xu
Yong Xu
Shi-Xiong Zhang
Kun Wei
Yiwen Shao
Chunlei Zhang
Dong Yu
74
2
0
30 Aug 2024
A Glance is Enough: Extract Target Sentence By Looking at A keyword
Ying Shi
Dong Wang
Lantian Li
Jiqing Han
94
1
0
09 Oct 2023
SA-Paraformer: Non-autoregressive End-to-End Speaker-Attributed ASR
Yangze Li
Fan Yu
Yuhao Liang
Pengcheng Guo
Mohan Shi
Zhihao Du
Shiliang Zhang
Lei Xie
44
4
0
07 Oct 2023
CASA-ASR: Context-Aware Speaker-Attributed ASR
Mohan Shi
Zhihao Du
Qian Chen
Fan Yu
Yangze Li
Shiliang Zhang
Jie Zhang
Lirong Dai
58
9
0
21 May 2023
Deep Transfer Learning for Automatic Speech Recognition: Towards Better Generalization
Hamza Kheddar
Yassine Himeur
S. Al-Maadeed
Abbes Amira
F. Bensaali
148
84
0
27 Apr 2023
A Comparative Study on Multichannel Speaker-Attributed Automatic Speech Recognition in Multi-party Meetings
Mohan Shi
Jie Zhang
Zhihao Du
Fan Yu
Qian Chen
Shiliang Zhang
Lirong Dai
81
4
0
01 Nov 2022
MFCCA:Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario
Fan Yu
Shiliang Zhang
Pengcheng Guo
Yuhao Liang
Zhihao Du
Yuxiao Lin
Linfu Xie
61
12
0
11 Oct 2022
Feature Learning and Ensemble Pre-Tasks Based Self-Supervised Speech Denoising and Dereverberation
Yi Li
ShuangLin Li
Yang Sun
S. M. Naqvi
38
0
0
10 Jun 2022
A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings
Fan Yu
Zhihao Du
Shiliang Zhang
Yuxiao Lin
Linfu Xie
42
15
0
31 Mar 2022
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge
Fan Yu
Shiliang Zhang
Pengcheng Guo
Yihui Fu
Zhihao Du
...
Kong Aik Lee
Zhijie Yan
B. Ma
Xin Xu
Hui Bu
59
28
0
08 Feb 2022
Speaker conditioning of acoustic models using affine transformation for multi-speaker speech recognition
Midia Yousefi
John H.L. Hanse
28
5
0
30 Oct 2021
An End-to-end Architecture of Online Multi-channel Speech Separation
Jian Wu
Zhuo Chen
Jinyu Li
Takuya Yoshioka
Zhili Tan
Ed Lin
Yi Luo
Lei Xie
3DV
38
21
0
07 Sep 2020
Audio-visual Recognition of Overlapped speech for the LRS2 dataset
Jianwei Yu
Shi-Xiong Zhang
Jian Wu
Shahram Ghorbani
Bo Wu
Shiyin Kang
Shansong Liu
Xunying Liu
Helen Meng
Dong Yu
85
73
0
06 Jan 2020
Utterance-level Permutation Invariant Training with Latency-controlled BLSTM for Single-channel Multi-talker Speech Separation
Lu Huang
Gaofeng Cheng
Pengyuan Zhang
Yi Yang
Shumin Xu
Jiasong Sun
15
8
0
25 Dec 2019
Simultaneous Speech Recognition and Speaker Diarization for Monaural Dialogue Recordings with Target-Speaker Acoustic Models
Naoyuki Kanda
Shota Horiguchi
Yusuke Fujita
Yawen Xue
Kenji Nagamatsu
Shinji Watanabe
58
36
0
17 Sep 2019
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning
Pavel Denisov
Ngoc Thang Vu
55
27
0
13 Aug 2019
Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition
Naoyuki Kanda
Shota Horiguchi
R. Takashima
Yusuke Fujita
Kenji Nagamatsu
Shinji Watanabe
68
34
0
26 Jun 2019
Unsupervised training of a deep clustering model for multichannel blind source separation
Lukas Drude
Daniel Hasenklever
Reinhold Häb-Umbach
SSL
69
58
0
02 Apr 2019
End-to-end Anchored Speech Recognition
Yiming Wang
Xing Fan
I-Fan Chen
Yuzong Liu
Tongfei Chen
Björn Hoffmeister
72
20
0
06 Feb 2019
End-to-end contextual speech recognition using class language models and a token passing decoder
Zhehuai Chen
Mahaveer Jain
Yongqiang Wang
M. Seltzer
Christian Fuegen
79
54
0
05 Dec 2018
A Comparison of Lattice-free Discriminative Training Criteria for Purely Sequence-Trained Neural Network Acoustic Models
Chao Weng
Manway Liu
62
5
0
08 Nov 2018
End-to-End Monaural Multi-speaker ASR System without Pretraining
Xuankai Chang
Y. Qian
Yi Liang
Deming Chen
87
77
0
05 Nov 2018
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks
Takuya Yoshioka
Hakan Erdogan
Zhuo Chen
Xiong Xiao
F. Alleva
BDL
84
82
0
08 Oct 2018
Linguistic Search Optimization for Deep Learning Based LVCSR
Zhehuai Chen
39
1
0
02 Aug 2018
Sequence Discriminative Training for Deep Learning based Acoustic Keyword Spotting
Zhehuai Chen
Y. Qian
Kai Yu
52
20
0
02 Aug 2018
Deep Extractor Network for Target Speaker Recovery From Single Channel Speech Mixtures
Jun Wang
Jie Chen
Dan Su
Lianwu Chen
Meng Yu
Y. Qian
Dong Yu
93
91
0
24 Jul 2018
A Purely End-to-end System for Multi-speaker Speech Recognition
Hiroshi Seki
Takaaki Hori
Shinji Watanabe
Jonathan Le Roux
J. Hershey
54
89
0
15 May 2018
Deep Discriminant Analysis for i-vector Based Robust Speaker Recognition
Shuai Wang
Zili Huang
Y. Qian
Kai Yu
27
8
0
03 May 2018
Recent Progresses in Deep Learning based Acoustic Models (Updated)
Dong Yu
Jinyu Li
VLM
77
160
0
25 Apr 2018
LCANet: End-to-End Lipreading with Cascaded Attention-CTC
Kai Xu
Dawei Li
N. Cassimatis
Xiaolong Wang
68
97
0
13 Mar 2018
Single-Channel Multi-talker Speech Recognition with Permutation Invariant Training
Y. Qian
Xuankai Chang
Dong Yu
54
79
0
19 Jul 2017
1