ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.07048
  4. Cited By
Progressive Joint Modeling in Unsupervised Single-channel Overlapped
  Speech Recognition
v1v2 (latest)

Progressive Joint Modeling in Unsupervised Single-channel Overlapped Speech Recognition

21 July 2017
Zhehuai Chen
J. Droppo
Jinyu Li
Wayne Xiong
ArXiv (abs)PDFHTML

Papers citing "Progressive Joint Modeling in Unsupervised Single-channel Overlapped Speech Recognition"

31 / 31 papers shown
Title
Advancing Multi-talker ASR Performance with Large Language Models
Advancing Multi-talker ASR Performance with Large Language Models
Mohan Shi
Zengrui Jin
Yaoxun Xu
Yong Xu
Shi-Xiong Zhang
Kun Wei
Yiwen Shao
Chunlei Zhang
Dong Yu
74
2
0
30 Aug 2024
A Glance is Enough: Extract Target Sentence By Looking at A keyword
A Glance is Enough: Extract Target Sentence By Looking at A keyword
Ying Shi
Dong Wang
Lantian Li
Jiqing Han
94
1
0
09 Oct 2023
SA-Paraformer: Non-autoregressive End-to-End Speaker-Attributed ASR
SA-Paraformer: Non-autoregressive End-to-End Speaker-Attributed ASR
Yangze Li
Fan Yu
Yuhao Liang
Pengcheng Guo
Mohan Shi
Zhihao Du
Shiliang Zhang
Lei Xie
44
4
0
07 Oct 2023
CASA-ASR: Context-Aware Speaker-Attributed ASR
CASA-ASR: Context-Aware Speaker-Attributed ASR
Mohan Shi
Zhihao Du
Qian Chen
Fan Yu
Yangze Li
Shiliang Zhang
Jie Zhang
Lirong Dai
58
9
0
21 May 2023
Deep Transfer Learning for Automatic Speech Recognition: Towards Better
  Generalization
Deep Transfer Learning for Automatic Speech Recognition: Towards Better Generalization
Hamza Kheddar
Yassine Himeur
S. Al-Maadeed
Abbes Amira
F. Bensaali
148
84
0
27 Apr 2023
A Comparative Study on Multichannel Speaker-Attributed Automatic Speech
  Recognition in Multi-party Meetings
A Comparative Study on Multichannel Speaker-Attributed Automatic Speech Recognition in Multi-party Meetings
Mohan Shi
Jie Zhang
Zhihao Du
Fan Yu
Qian Chen
Shiliang Zhang
Lirong Dai
81
4
0
01 Nov 2022
MFCCA:Multi-Frame Cross-Channel attention for multi-speaker ASR in
  Multi-party meeting scenario
MFCCA:Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario
Fan Yu
Shiliang Zhang
Pengcheng Guo
Yuhao Liang
Zhihao Du
Yuxiao Lin
Linfu Xie
61
12
0
11 Oct 2022
Feature Learning and Ensemble Pre-Tasks Based Self-Supervised Speech
  Denoising and Dereverberation
Feature Learning and Ensemble Pre-Tasks Based Self-Supervised Speech Denoising and Dereverberation
Yi Li
ShuangLin Li
Yang Sun
S. M. Naqvi
38
0
0
10 Jun 2022
A Comparative Study on Speaker-attributed Automatic Speech Recognition
  in Multi-party Meetings
A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings
Fan Yu
Zhihao Du
Shiliang Zhang
Yuxiao Lin
Linfu Xie
42
15
0
31 Mar 2022
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting
  Transcription Grand Challenge
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge
Fan Yu
Shiliang Zhang
Pengcheng Guo
Yihui Fu
Zhihao Du
...
Kong Aik Lee
Zhijie Yan
B. Ma
Xin Xu
Hui Bu
59
28
0
08 Feb 2022
Speaker conditioning of acoustic models using affine transformation for
  multi-speaker speech recognition
Speaker conditioning of acoustic models using affine transformation for multi-speaker speech recognition
Midia Yousefi
John H.L. Hanse
28
5
0
30 Oct 2021
An End-to-end Architecture of Online Multi-channel Speech Separation
An End-to-end Architecture of Online Multi-channel Speech Separation
Jian Wu
Zhuo Chen
Jinyu Li
Takuya Yoshioka
Zhili Tan
Ed Lin
Yi Luo
Lei Xie
3DV
38
21
0
07 Sep 2020
Audio-visual Recognition of Overlapped speech for the LRS2 dataset
Audio-visual Recognition of Overlapped speech for the LRS2 dataset
Jianwei Yu
Shi-Xiong Zhang
Jian Wu
Shahram Ghorbani
Bo Wu
Shiyin Kang
Shansong Liu
Xunying Liu
Helen Meng
Dong Yu
85
73
0
06 Jan 2020
Utterance-level Permutation Invariant Training with Latency-controlled
  BLSTM for Single-channel Multi-talker Speech Separation
Utterance-level Permutation Invariant Training with Latency-controlled BLSTM for Single-channel Multi-talker Speech Separation
Lu Huang
Gaofeng Cheng
Pengyuan Zhang
Yi Yang
Shumin Xu
Jiasong Sun
15
8
0
25 Dec 2019
Simultaneous Speech Recognition and Speaker Diarization for Monaural
  Dialogue Recordings with Target-Speaker Acoustic Models
Simultaneous Speech Recognition and Speaker Diarization for Monaural Dialogue Recordings with Target-Speaker Acoustic Models
Naoyuki Kanda
Shota Horiguchi
Yusuke Fujita
Yawen Xue
Kenji Nagamatsu
Shinji Watanabe
58
36
0
17 Sep 2019
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and
  Transfer Learning
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning
Pavel Denisov
Ngoc Thang Vu
55
27
0
13 Aug 2019
Auxiliary Interference Speaker Loss for Target-Speaker Speech
  Recognition
Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition
Naoyuki Kanda
Shota Horiguchi
R. Takashima
Yusuke Fujita
Kenji Nagamatsu
Shinji Watanabe
68
34
0
26 Jun 2019
Unsupervised training of a deep clustering model for multichannel blind
  source separation
Unsupervised training of a deep clustering model for multichannel blind source separation
Lukas Drude
Daniel Hasenklever
Reinhold Häb-Umbach
SSL
69
58
0
02 Apr 2019
End-to-end Anchored Speech Recognition
End-to-end Anchored Speech Recognition
Yiming Wang
Xing Fan
I-Fan Chen
Yuzong Liu
Tongfei Chen
Björn Hoffmeister
72
20
0
06 Feb 2019
End-to-end contextual speech recognition using class language models and
  a token passing decoder
End-to-end contextual speech recognition using class language models and a token passing decoder
Zhehuai Chen
Mahaveer Jain
Yongqiang Wang
M. Seltzer
Christian Fuegen
79
54
0
05 Dec 2018
A Comparison of Lattice-free Discriminative Training Criteria for Purely
  Sequence-Trained Neural Network Acoustic Models
A Comparison of Lattice-free Discriminative Training Criteria for Purely Sequence-Trained Neural Network Acoustic Models
Chao Weng
Manway Liu
62
5
0
08 Nov 2018
End-to-End Monaural Multi-speaker ASR System without Pretraining
End-to-End Monaural Multi-speaker ASR System without Pretraining
Xuankai Chang
Y. Qian
Yi Liang
Deming Chen
87
77
0
05 Nov 2018
Recognizing Overlapped Speech in Meetings: A Multichannel Separation
  Approach Using Neural Networks
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks
Takuya Yoshioka
Hakan Erdogan
Zhuo Chen
Xiong Xiao
F. Alleva
BDL
84
82
0
08 Oct 2018
Linguistic Search Optimization for Deep Learning Based LVCSR
Linguistic Search Optimization for Deep Learning Based LVCSR
Zhehuai Chen
39
1
0
02 Aug 2018
Sequence Discriminative Training for Deep Learning based Acoustic
  Keyword Spotting
Sequence Discriminative Training for Deep Learning based Acoustic Keyword Spotting
Zhehuai Chen
Y. Qian
Kai Yu
52
20
0
02 Aug 2018
Deep Extractor Network for Target Speaker Recovery From Single Channel
  Speech Mixtures
Deep Extractor Network for Target Speaker Recovery From Single Channel Speech Mixtures
Jun Wang
Jie Chen
Dan Su
Lianwu Chen
Meng Yu
Y. Qian
Dong Yu
93
91
0
24 Jul 2018
A Purely End-to-end System for Multi-speaker Speech Recognition
A Purely End-to-end System for Multi-speaker Speech Recognition
Hiroshi Seki
Takaaki Hori
Shinji Watanabe
Jonathan Le Roux
J. Hershey
54
89
0
15 May 2018
Deep Discriminant Analysis for i-vector Based Robust Speaker Recognition
Deep Discriminant Analysis for i-vector Based Robust Speaker Recognition
Shuai Wang
Zili Huang
Y. Qian
Kai Yu
27
8
0
03 May 2018
Recent Progresses in Deep Learning based Acoustic Models (Updated)
Recent Progresses in Deep Learning based Acoustic Models (Updated)
Dong Yu
Jinyu Li
VLM
77
160
0
25 Apr 2018
LCANet: End-to-End Lipreading with Cascaded Attention-CTC
LCANet: End-to-End Lipreading with Cascaded Attention-CTC
Kai Xu
Dawei Li
N. Cassimatis
Xiaolong Wang
68
97
0
13 Mar 2018
Single-Channel Multi-talker Speech Recognition with Permutation
  Invariant Training
Single-Channel Multi-talker Speech Recognition with Permutation Invariant Training
Y. Qian
Xuankai Chang
Dong Yu
54
79
0
19 Jul 2017
1