An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification

22 May 2023

Siqi Zheng

Qian Chen

Papers citing "An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification"

21 / 21 papers shown

Title
MGFF-TDNN: A Multi-Granularity Feature Fusion TDNN Model with Depth-Wise Separable Module for Speaker Verification Ya Li Bin Zhou Bo Hu 140 0 0 06 May 2025
A Multi-task Learning Balanced Attention Convolutional Neural Network Model for Few-shot Underwater Acoustic Target Recognition W. R. Huang Shumeng Sun Junpeng Lu Zhenpeng Xu Zhengyang Xiu Hao Zhang 24 0 0 17 Apr 2025
Nes2Net: A Lightweight Nested Architecture for Foundation Model Driven Speech Anti-spoofing Tianchi Liu Duc-Tuan Truong Rohan Kumar Das K. Lee Haizhou Li 31 0 0 08 Apr 2025
MAVFlow: Preserving Paralinguistic Elements with Conditional Flow Matching for Zero-Shot AV2AV Multilingual Translation Sungwoo Cho J. Choi Sungnyun Kim Se-Young Yun 63 0 0 14 Mar 2025
Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction Ailin Huang Boyong Wu Bruce Wang Chao Yan Chen Hu ... Tianyu Wang Wenjin Deng Wuxun Xie Weipeng Ming Wenqing He AuLLM 77 9 0 17 Feb 2025
SyncSpeech: Low-Latency and Efficient Dual-Stream Text-to-Speech based on Temporal Masked Transformer Zhengyan Sheng Zhihao Du Shiliang Zhang Zhijie Yan Yexin Yang Zhenhua Ling 51 1 0 16 Feb 2025
The First VoicePrivacy Attacker Challenge Evaluation Plan N. Tomashenko Xiaoxiao Miao Emmanuel Vincent Junichi Yamagishi 125 2 0 09 Oct 2024
Improving Speaker Representations Using Contrastive Losses on Multi-scale Features Satvik Dixit Massa Baali Rita Singh Bhiksha Raj 24 0 0 07 Oct 2024
Disentangling Age and Identity with a Mutual Information Minimization Approach for Cross-Age Speaker Verification Fengrun Zhang Wangjin Zhou Yiming Liu Wang Geng Yahui Shan Chen Zhang 26 0 0 24 Sep 2024
Towards Quantifying and Reducing Language Mismatch Effects in Cross-Lingual Speech Anti-Spoofing Tianchi Liu Ivan Kukanov Zihan Pan Qiongqiong Wang Hardik B. Sailor K. Lee 37 2 0 12 Sep 2024
Integrating Audio, Visual, and Semantic Information for Enhanced Multimodal Speaker Diarization Luyao Cheng Hui Wang Siqi Zheng Yafeng Chen Rongjie Huang Qinglin Zhang Qian Chen Xihao Li 33 1 0 22 Aug 2024
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning Shuai Wang Zheng-Shou Chen Kong Aik Lee Yan-min Qian Haizhou Li 37 4 0 21 Jul 2024
FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs Keyu An Qian Chen Chong Deng Zhihao Du Changfeng Gao ... Bin Zhang Qinglin Zhang Shiliang Zhang Nan Zhao Siqi Zheng AuLLM 29 44 0 04 Jul 2024
GMM-ResNext: Combining Generative and Discriminative Models for Speaker Verification Hui Yan Zhenchun Lei Changhong Liu Yong Zhou 21 2 0 03 Jul 2024
Imperceptible Rhythm Backdoor Attacks: Exploring Rhythm Transformation for Embedding Undetectable Vulnerabilities on Speech Recognition Wenhan Yao Jiangkun Yang yongqiang He Jia Liu Weiping Wen 44 1 0 16 Jun 2024
Certification of Speaker Recognition Models to Additive Perturbations Dmitrii Korzh Elvir Karimov Mikhail Aleksandrovich Pautov Oleg Y. Rogov Ivan V. Oseledets 50 1 0 29 Apr 2024
A Comparison of Differential Performance Metrics for the Evaluation of Automatic Speaker Verification Fairness Oubaïda Chouchane Christoph Busch Chiara Galdi Nicholas W. D. Evans Massimiliano Todisco 29 1 0 27 Apr 2024
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification Tianchi Liu Kong Aik Lee Qiongqiong Wang Haizhou Li VLM 68 13 0 06 Dec 2023
ChinaTelecom System Description to VoxCeleb Speaker Recognition Challenge 2023 Mengjie Du Xiang Fang Jie Li 29 0 0 16 Aug 2023
3D-Speaker: A Large-Scale Multi-Device, Multi-Distance, and Multi-Dialect Corpus for Speech Representation Disentanglement Siqi Zheng Luyao Cheng Yafeng Chen Haibo Wang Qian Chen 14 16 0 27 Jun 2023
VoxCeleb2: Deep Speaker Recognition Joon Son Chung Arsha Nagrani Andrew Zisserman 224 2,234 0 14 Jun 2018