Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2107.09321
Cited By
A Real-time Speaker Diarization System Based on Spatial Spectrum
20 July 2021
Siqi Zheng
Weilong Huang
Xianliang Wang
Hongbin Suo
Jinwei Feng
Zhijie Yan
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Real-time Speaker Diarization System Based on Spatial Spectrum"
16 / 16 papers shown
Title
Integrating Audio, Visual, and Semantic Information for Enhanced Multimodal Speaker Diarization
Luyao Cheng
Hui Wang
Siqi Zheng
Yafeng Chen
Rongjie Huang
Qinglin Zhang
Qian Chen
Xihao Li
33
1
0
22 Aug 2024
The Neural-SRP method for positional sound source localization
Eric Grinstein
Toon van Waterschoot
Mike Brookes
Patrick A. Naylor
26
2
0
14 Mar 2024
3D-Speaker: A Large-Scale Multi-Device, Multi-Distance, and Multi-Dialect Corpus for Speech Representation Disentanglement
Siqi Zheng
Luyao Cheng
Yafeng Chen
Haibo Wang
Qian Chen
27
18
0
27 Jun 2023
BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR
Yuhao Liang
Fan Yu
Yangze Li
Pengcheng Guo
Shiliang Zhang
Qian Chen
Linfu Xie
33
8
0
23 May 2023
Exploring Speaker-Related Information in Spoken Language Understanding for Better Speaker Diarization
Luyao Cheng
Siqi Zheng
Zhang Qinglin
Haibo Wang
Yafeng Chen
Qian Chen
43
4
0
22 May 2023
CASA-ASR: Context-Aware Speaker-Attributed ASR
Mohan Shi
Zhihao Du
Qian Chen
Fan Yu
Yangze Li
Shiliang Zhang
Jie Zhang
Lirong Dai
36
8
0
21 May 2023
MFCCA:Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario
Fan Yu
Shiliang Zhang
Pengcheng Guo
Yuhao Liang
Zhihao Du
Yuxiao Lin
Linfu Xie
33
11
0
11 Oct 2022
Iterative Sound Source Localization for Unknown Number of Sources
Yanjie Fu
Meng Ge
Haoran Yin
Xinyuan Qian
Longbiao Wang
Gaoyan Zhang
J. Dang
34
8
0
24 Jun 2022
PRISM: Pre-trained Indeterminate Speaker Representation Model for Speaker Diarization and Speaker Verification
Siqi Zheng
Hongbin Suo
Qian Chen
35
4
0
16 May 2022
Reformulating Speaker Diarization as Community Detection With Emphasis On Topological Structure
Siqi Zheng
Hongbin Suo
28
7
0
26 Apr 2022
A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings
Fan Yu
Zhihao Du
Shiliang Zhang
Yuxiao Lin
Linfu Xie
24
13
0
31 Mar 2022
The Volcspeech system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge
Chen Shen
Yi Y. Liu
Wenzhi Fan
Bin Wang
Shi-Xue Wen
Yao Tian
Jun Zhang
Jingsheng Yang
Zejun Ma
17
4
0
09 Feb 2022
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge
Fan Yu
Shiliang Zhang
Pengcheng Guo
Yihui Fu
Zhihao Du
...
Kong Aik Lee
Zhijie Yan
B. Ma
Xin Xu
Hui Bu
18
28
0
08 Feb 2022
Speaker Embedding-aware Neural Diarization for Flexible Number of Speakers with Textual Information
Zhihao Du
Shiliang Zhang
Siqi Zheng
Weilong Huang
Ming Lei
BDL
24
1
0
28 Nov 2021
M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge
Fan Yu
Shiliang Zhang
Yihui Fu
Lei Xie
Siqi Zheng
...
Pengcheng Guo
Zhijie Yan
B. Ma
Xin Xu
Hui Bu
11
106
0
14 Oct 2021
BeamTransformer: Microphone Array-based Overlapping Speech Detection
Siqi Zheng
Shiliang Zhang
Weilong Huang
Qian Chen
Hongbin Suo
Ming Lei
Jinwei Feng
Zhijie Yan
37
7
0
09 Sep 2021
1