Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2004.00526
Cited By
v1
v2 (latest)
Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw Waveforms
Interspeech (Interspeech), 2020
1 April 2020
Jee-weon Jung
Seung-bin Kim
Hye-jin Shim
Ju-ho Kim
Ha-Jin Yu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw Waveforms"
29 / 29 papers shown
ADD 2023: Towards Audio Deepfake Detection and Analysis in the Wild
Jiangyan Yi
Chu Yuan Zhang
Jianhua Tao
Chenglong Wang
Xinrui Yan
Yong Ren
Hao Gu
Junzuo Zhou
324
15
0
09 Aug 2024
Self-Attention and Hybrid Features for Replay and Deep-Fake Audio Detection
Lian Huang
Chi-Man Pun
220
14
0
11 Jan 2024
Malafide: a novel adversarial convolutive noise attack against deepfake and spoofing detection systems
Interspeech (Interspeech), 2023
Michele Panariello
W. Ge
Hemlata Tak
Massimiliano Todisco
Nicholas W. D. Evans
AAML
249
22
0
13 Jun 2023
TO-Rawnet: Improving RawNet with TCN and Orthogonal Regularization for Fake Audio Detection
Interspeech (Interspeech), 2023
Chenglong Wang
Jiangyan Yi
Jianhua Tao
Chuyuan Zhang
Shuai Zhang
Ruibo Fu
Xun Chen
231
15
0
23 May 2023
Speaker-Aware Anti-Spoofing
Interspeech (Interspeech), 2023
Xuechen Liu
Md. Sahidullah
Kong Aik Lee
Tomi Kinnunen
262
4
0
02 Mar 2023
Leveraging Speaker Embeddings with Adversarial Multi-task Learning for Age Group Classification
Kwangje Baeg
Yeong-Gwan Kim
Youngsub Han
Byoung-Ki Jeon
180
0
0
22 Jan 2023
Audio Anti-spoofing Using a Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning
Interspeech (Interspeech), 2022
John H. L. Hansen
Zhenyu Wang
316
19
0
17 Nov 2022
SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection
Pattern Recognition (Pattern Recogn.), 2022
Jiangyan Yi
Chenglong Wang
Jianhua Tao
Chu Yuan Zhang
Cunhang Fan
Zhengkun Tian
Haoxin Ma
Ruibo Fu
270
28
0
11 Nov 2022
EmoFake: An Initial Dataset for Emotion Fake Audio Detection
China National Conference on Chinese Computational Linguistics (CCL), 2022
Yan Zhao
Jiangyan Yi
Jianhua Tao
Chenglong Wang
Xiaohui Zhang
Yongfeng Dong
217
24
0
10 Nov 2022
Individualized Conditioning and Negative Distances for Speaker Separation
International Conference on Machine Learning and Applications (ICMLA), 2022
Tao Sun
Nidal Abuhajar
Shuyu Gong
Zhewei Wang
Charles D. Smith
Xianhui Wang
Li Xu
Jundong Liu
VLM
220
1
0
12 Oct 2022
SpecRNet: Towards Faster and More Accessible Audio DeepFake Detection
International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom), 2022
Piotr Kawa
Marcin Plata
P. Syga
269
27
0
12 Oct 2022
ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Xuechen Liu
Xin Wang
Md. Sahidullah
J. Patino
Héctor Delgado
...
Massimiliano Todisco
Junichi Yamagishi
Nicholas W. D. Evans
A. Nautsch
Kong Aik Lee
409
331
0
05 Oct 2022
The ReturnZero System for VoxCeleb Speaker Recognition Challenge 2022
Sangwon Suh
Sunjong Park
203
2
0
21 Sep 2022
DT-SV: A Transformer-based Time-domain Approach for Speaker Verification
IEEE International Joint Conference on Neural Network (IJCNN), 2022
Nan Zhang
Jianzong Wang
Zhenhou Hong
Chendong Zhao
Xiaoyang Qu
Jing Xiao
269
5
0
26 May 2022
Improved Relation Networks for End-to-End Speaker Verification and Identification
Interspeech (Interspeech), 2022
Ashutosh Chaubey
Sparsh Sinha
Susmita Ghose
172
4
0
31 Mar 2022
Pushing the limits of raw waveform speaker recognition
Interspeech (Interspeech), 2022
Jee-weon Jung
You Jin Kim
Hee-Soo Heo
Bong-Jin Lee
Youngki Kwon
Joon Son Chung
253
126
0
16 Mar 2022
ICASSP 2022 Deep Noise Suppression Challenge
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Harishchandra Dubey
Vishak Gopal
Ross Cutler
Chandan K. A. Reddy
Sergiy Matusevych
...
Sefik Emre Eskimez
Manthan Thakker
Sriram Srinivasan
H. Gamper
R. Aichner
391
229
0
27 Feb 2022
ADD 2022: the First Audio Deep Synthesis Detection Challenge
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Jiangyan Yi
Ruibo Fu
Jianhua Tao
Shuai Nie
Haoxin Ma
...
Le Xu
Zhengqi Wen
Haizhou Li
Zheng Lian
Bin Liu
347
257
0
17 Feb 2022
Graph attentive feature aggregation for text-independent speaker verification
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Hye-jin Shim
Ju-Sung Heo
Jae-han Park
Gareth Lee
Ha-Jin Yu
241
18
0
23 Dec 2021
RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Speaker Verification Anti-Spoofing
Hemlata Tak
Madhu R. Kamble
J. Patino
Massimiliano Todisco
Nicholas W. D. Evans
291
191
0
08 Nov 2021
WaveFake: A Data Set to Facilitate Audio Deepfake Detection
Joel Frank
Lea Schonherr
DiffM
427
203
0
04 Nov 2021
A study of the robustness of raw waveform based speaker embeddings under mismatched conditions
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Ge Zhu
Frank Cwitkowitz
Z. Duan
288
3
0
08 Oct 2021
ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection
Junichi Yamagishi
Xin Wang
Massimiliano Todisco
Md. Sahidullah
J. Patino
...
Xuechen Liu
Kong Aik Lee
Tomi Kinnunen
Nicholas W. D. Evans
Héctor Delgado
242
491
0
01 Sep 2021
End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection
Hemlata Tak
Jee-weon Jung
J. Patino
Madhu R. Kamble
Massimiliano Todisco
Nicholas W. D. Evans
331
232
0
27 Jul 2021
Learning Metrics from Mean Teacher: A Supervised Learning Method for Improving the Generalization of Speaker Verification System
Ju-ho Kim
Hye-jin Shim
Jee-weon Jung
Ha-Jin Yu
231
1
0
14 Apr 2021
Graph Attention Networks for Anti-Spoofing
Interspeech (Interspeech), 2021
Hemlata Tak
Jee-weon Jung
J. Patino
Massimiliano Todisco
Nicholas W. D. Evans
270
90
0
08 Apr 2021
Y-Vector: Multiscale Waveform Encoder for Speaker Embedding
Interspeech (Interspeech), 2020
Ge Zhu
Fei Jiang
Z. Duan
307
26
0
24 Oct 2020
Perceptual Loss based Speech Denoising with an ensemble of Audio Pattern Recognition and Self-Supervised Models
Saurabh Kataria
Jesús Villalba
Najim Dehak
VLM
SSL
222
39
0
22 Oct 2020
Utterance-level Aggregation For Speaker Recognition In The Wild
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Weidi Xie
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
324
366
0
26 Feb 2019
1
Page 1 of 1