ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.12838
  4. Cited By
An Enhanced Res2Net with Local and Global Feature Fusion for Speaker
  Verification

An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification

22 May 2023
Yafeng Chen
Siqi Zheng
Haibo Wang
Luyao Cheng
Qian Chen
Jiajun Qi
ArXivPDFHTML

Papers citing "An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification"

21 / 21 papers shown
Title
MGFF-TDNN: A Multi-Granularity Feature Fusion TDNN Model with Depth-Wise Separable Module for Speaker Verification
MGFF-TDNN: A Multi-Granularity Feature Fusion TDNN Model with Depth-Wise Separable Module for Speaker Verification
Ya Li
Bin Zhou
Bo Hu
140
0
0
06 May 2025
A Multi-task Learning Balanced Attention Convolutional Neural Network Model for Few-shot Underwater Acoustic Target Recognition
A Multi-task Learning Balanced Attention Convolutional Neural Network Model for Few-shot Underwater Acoustic Target Recognition
W. R. Huang
Shumeng Sun
Junpeng Lu
Zhenpeng Xu
Zhengyang Xiu
Hao Zhang
24
0
0
17 Apr 2025
Nes2Net: A Lightweight Nested Architecture for Foundation Model Driven Speech Anti-spoofing
Nes2Net: A Lightweight Nested Architecture for Foundation Model Driven Speech Anti-spoofing
Tianchi Liu
Duc-Tuan Truong
Rohan Kumar Das
K. Lee
Haizhou Li
31
0
0
08 Apr 2025
MAVFlow: Preserving Paralinguistic Elements with Conditional Flow Matching for Zero-Shot AV2AV Multilingual Translation
Sungwoo Cho
J. Choi
Sungnyun Kim
Se-Young Yun
63
0
0
14 Mar 2025
Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction
Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction
Ailin Huang
Boyong Wu
Bruce Wang
Chao Yan
Chen Hu
...
Tianyu Wang
Wenjin Deng
Wuxun Xie
Weipeng Ming
Wenqing He
AuLLM
77
9
0
17 Feb 2025
SyncSpeech: Low-Latency and Efficient Dual-Stream Text-to-Speech based on Temporal Masked Transformer
SyncSpeech: Low-Latency and Efficient Dual-Stream Text-to-Speech based on Temporal Masked Transformer
Zhengyan Sheng
Zhihao Du
Shiliang Zhang
Zhijie Yan
Yexin Yang
Zhenhua Ling
51
1
0
16 Feb 2025
The First VoicePrivacy Attacker Challenge Evaluation Plan
The First VoicePrivacy Attacker Challenge Evaluation Plan
N. Tomashenko
Xiaoxiao Miao
Emmanuel Vincent
Junichi Yamagishi
125
2
0
09 Oct 2024
Improving Speaker Representations Using Contrastive Losses on
  Multi-scale Features
Improving Speaker Representations Using Contrastive Losses on Multi-scale Features
Satvik Dixit
Massa Baali
Rita Singh
Bhiksha Raj
24
0
0
07 Oct 2024
Disentangling Age and Identity with a Mutual Information Minimization
  Approach for Cross-Age Speaker Verification
Disentangling Age and Identity with a Mutual Information Minimization Approach for Cross-Age Speaker Verification
Fengrun Zhang
Wangjin Zhou
Yiming Liu
Wang Geng
Yahui Shan
Chen Zhang
26
0
0
24 Sep 2024
Towards Quantifying and Reducing Language Mismatch Effects in
  Cross-Lingual Speech Anti-Spoofing
Towards Quantifying and Reducing Language Mismatch Effects in Cross-Lingual Speech Anti-Spoofing
Tianchi Liu
Ivan Kukanov
Zihan Pan
Qiongqiong Wang
Hardik B. Sailor
K. Lee
37
2
0
12 Sep 2024
Integrating Audio, Visual, and Semantic Information for Enhanced
  Multimodal Speaker Diarization
Integrating Audio, Visual, and Semantic Information for Enhanced Multimodal Speaker Diarization
Luyao Cheng
Hui Wang
Siqi Zheng
Yafeng Chen
Rongjie Huang
Qinglin Zhang
Qian Chen
Xihao Li
33
1
0
22 Aug 2024
Overview of Speaker Modeling and Its Applications: From the Lens of Deep
  Speaker Representation Learning
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning
Shuai Wang
Zheng-Shou Chen
Kong Aik Lee
Yan-min Qian
Haizhou Li
37
4
0
21 Jul 2024
FunAudioLLM: Voice Understanding and Generation Foundation Models for
  Natural Interaction Between Humans and LLMs
FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs
Keyu An
Qian Chen
Chong Deng
Zhihao Du
Changfeng Gao
...
Bin Zhang
Qinglin Zhang
Shiliang Zhang
Nan Zhao
Siqi Zheng
AuLLM
29
44
0
04 Jul 2024
GMM-ResNext: Combining Generative and Discriminative Models for Speaker
  Verification
GMM-ResNext: Combining Generative and Discriminative Models for Speaker Verification
Hui Yan
Zhenchun Lei
Changhong Liu
Yong Zhou
21
2
0
03 Jul 2024
Imperceptible Rhythm Backdoor Attacks: Exploring Rhythm Transformation
  for Embedding Undetectable Vulnerabilities on Speech Recognition
Imperceptible Rhythm Backdoor Attacks: Exploring Rhythm Transformation for Embedding Undetectable Vulnerabilities on Speech Recognition
Wenhan Yao
Jiangkun Yang
yongqiang He
Jia Liu
Weiping Wen
44
1
0
16 Jun 2024
Certification of Speaker Recognition Models to Additive Perturbations
Certification of Speaker Recognition Models to Additive Perturbations
Dmitrii Korzh
Elvir Karimov
Mikhail Aleksandrovich Pautov
Oleg Y. Rogov
Ivan V. Oseledets
50
1
0
29 Apr 2024
A Comparison of Differential Performance Metrics for the Evaluation of
  Automatic Speaker Verification Fairness
A Comparison of Differential Performance Metrics for the Evaluation of Automatic Speaker Verification Fairness
Oubaïda Chouchane
Christoph Busch
Chiara Galdi
Nicholas W. D. Evans
Massimiliano Todisco
29
1
0
27 Apr 2024
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker
  Verification
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification
Tianchi Liu
Kong Aik Lee
Qiongqiong Wang
Haizhou Li
VLM
68
13
0
06 Dec 2023
ChinaTelecom System Description to VoxCeleb Speaker Recognition
  Challenge 2023
ChinaTelecom System Description to VoxCeleb Speaker Recognition Challenge 2023
Mengjie Du
Xiang Fang
Jie Li
29
0
0
16 Aug 2023
3D-Speaker: A Large-Scale Multi-Device, Multi-Distance, and
  Multi-Dialect Corpus for Speech Representation Disentanglement
3D-Speaker: A Large-Scale Multi-Device, Multi-Distance, and Multi-Dialect Corpus for Speech Representation Disentanglement
Siqi Zheng
Luyao Cheng
Yafeng Chen
Haibo Wang
Qian Chen
14
16
0
27 Jun 2023
VoxCeleb2: Deep Speaker Recognition
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
224
2,234
0
14 Jun 2018
1