Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1808.00158
Cited By
Speaker Recognition from Raw Waveform with SincNet
29 July 2018
Mirco Ravanelli
Yoshua Bengio
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Speaker Recognition from Raw Waveform with SincNet"
50 / 259 papers shown
Title
AS2T: Arbitrary Source-To-Target Adversarial Attack on Speaker Recognition Systems
Guangke Chen
Zhe Zhao
Fu Song
Sen Chen
Lingling Fan
Yang Liu
AAML
17
18
0
07 Jun 2022
Radar Image Reconstruction from Raw ADC Data using Parametric Variational Autoencoder with Domain Adaptation
Michael Stephan
Thomas Stadelmayer
Avik Santra
Georg Fischer0001
R. Weigel
F. Lurz
8
10
0
30 May 2022
Adversarial attacks and defenses in Speaker Recognition Systems: A survey
Jiahe Lan
Rui Zhang
Zheng Yan
Jie Wang
Yu Chen
Ronghui Hou
AAML
9
23
0
27 May 2022
Trainable Wavelet Neural Network for Non-Stationary Signals
Jason Stock
Chuck Anderson
9
3
0
06 May 2022
Dictionary Attacks on Speaker Verification
Mirko Marras
Pawel Korus
Anubhav Jain
N. Memon
AAML
13
9
0
24 Apr 2022
MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids
Ryandhimas E. Zezario
Fei Chen
C. Fuh
Hsin-Min Wang
Yu Tsao
16
16
0
07 Apr 2022
HiFi-VC: High Quality ASR-Based Voice Conversion
A. Kashkin
I. Karpukhin
S. Shishkin
14
5
0
31 Mar 2022
Does Audio Deepfake Detection Generalize?
Nicolas M. Muller
Pavel Czempin
Franziska Dieckmann
Adam Froghyar
Konstantin Böttinger
25
136
0
30 Mar 2022
Combination of Time-domain, Frequency-domain, and Cepstral-domain Acoustic Features for Speech Commands Classification
Yikang Wang
Hiromitsu Nishizaki
17
1
0
30 Mar 2022
Learning neural audio features without supervision
Sarthak Yadav
Neil Zeghidour
SSL
30
4
0
29 Mar 2022
Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?
Cho-Ying Wu
Chin-Cheng Hsu
Ulrich Neumann
CVBM
4
14
0
18 Mar 2022
Pushing the limits of raw waveform speaker recognition
Jee-weon Jung
You Jin Kim
Hee-Soo Heo
Bong-Jin Lee
Youngki Kwon
Joon Son Chung
23
87
0
16 Mar 2022
Audio Self-supervised Learning: A Survey
Shuo Liu
Adria Mallol-Ragolta
Emilia Parada-Cabeleiro
Kun Qian
Xingshuo Jing
Alexander Kathan
Bin Hu
Bjoern W. Schuller
SSL
22
106
0
02 Mar 2022
Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation
Hemlata Tak
Massimiliano Todisco
Xin Wang
Jee-weon Jung
Junichi Yamagishi
Nicholas W. D. Evans
19
151
0
24 Feb 2022
Partially Fake Audio Detection by Self-attention-based Fake Span Discovery
Haibin Wu
Heng-Cheng Kuo
Naijun Zheng
Kuo-Hsuan Hung
Hung-yi Lee
Yu Tsao
Hsin-Min Wang
H. Meng
14
36
0
14 Feb 2022
The xmuspeech system for multi-channel multi-party meeting transcription challenge
Jie Wang
Yuji Liu
Binling Wang
Yiming Zhi
Song Li
Shipeng Xia
Jiayang Zhang
Lin Li
Q. Hong
Feng Tong
11
0
0
11 Feb 2022
Learnable Nonlinear Compression for Robust Speaker Verification
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
17
2
0
10 Feb 2022
CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations
Vin Sachidananda
Shao-Yen Tseng
Erik Marchi
S. Kajarekar
P. Georgiou
21
8
0
08 Feb 2022
Learnable Wavelet Packet Transform for Data-Adapted Spectrograms
Gaetan Frusque
Olga Fink
9
13
0
26 Jan 2022
Real-Time Seizure Detection using EEG: A Comprehensive Comparison of Recent Approaches under a Realistic Setting
Kwanhyung Lee
Hyewon Jeong
Seyun Kim
Donghwa Yang
Hoon-Chul Kang
E. Choi
OOD
4
12
0
21 Jan 2022
A Practical Guide to Logical Access Voice Presentation Attack Detection
Xin Wang
Junichi Yamagishi
AAML
11
10
0
10 Jan 2022
Towards Relatable Explainable AI with the Perceptual Process
Wencan Zhang
Brian Y. Lim
AAML
XAI
9
61
0
28 Dec 2021
Deep Spoken Keyword Spotting: An Overview
Iván López-Espejo
Z. Tan
John H. L. Hansen
Jesper Jensen
11
99
0
20 Nov 2021
Investigating self-supervised front ends for speech spoofing countermeasures
Xin Wang
Junichi Yamagishi
AAML
17
123
0
15 Nov 2021
Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features
Ryandhimas E. Zezario
Szu-Wei Fu
Fei Chen
C. Fuh
Hsin-Min Wang
Yu Tsao
DiffM
19
75
0
03 Nov 2021
A Comparative Study of Speaker Role Identification in Air Traffic Communication Using Deep Learning Approaches
Dongyue Guo
Jianwei Zhang
Bo Yang
Yi Lin
17
10
0
03 Nov 2021
FANS: Fusing ASR and NLU for on-device SLU
Martin H. Radfar
Athanasios Mouchtaris
Siegfried Kunzmann
Ariya Rastrow
17
12
0
31 Oct 2021
Deep Learning For Prominence Detection In Children's Read Speech
Mithilesh Vaidya
Kamini Sabu
Preeti Rao
6
6
0
27 Oct 2021
Optimizing Multi-Taper Features for Deep Speaker Verification
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
16
1
0
21 Oct 2021
EEGminer: Discovering Interpretable Features of Brain Activity with Learnable Filters
Siegfried Ludwig
Stylianos Bakas
D. Adamos
N. Laskaris
Yannis Panagakis
S. Zafeiriou
8
6
0
19 Oct 2021
Multistage linguistic conditioning of convolutional layers for speech emotion recognition
Andreas Triantafyllopoulos
U. Reichel
Shuo Liu
Simon Huber
F. Eyben
Björn W. Schuller
25
9
0
13 Oct 2021
Large-scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification
Zhengyang Chen
Sanyuan Chen
Yu-Huan Wu
Yao Qian
Chengyi Wang
Shujie Liu
Y. Qian
Michael Zeng
SSL
15
124
0
12 Oct 2021
A study of the robustness of raw waveform based speaker embeddings under mismatched conditions
Ge Zhu
Frank Cwitkowitz
Z. Duan
22
2
0
08 Oct 2021
AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks
Jee-weon Jung
Hee-Soo Heo
Hemlata Tak
Hye-jin Shim
Joon Son Chung
Bong-Jin Lee
Ha-Jin Yu
Nicholas W. D. Evans
121
279
0
04 Oct 2021
Optimized Power Normalized Cepstral Coefficients towards Robust Deep Speaker Verification
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
18
6
0
24 Sep 2021
MS-SincResNet: Joint learning of 1D and 2D kernels using multi-scale SincNet and ResNet for music genre classification
Pei-Chun Chang
Yonghao Chen
Chang-Hsing Lee
15
21
0
18 Sep 2021
Behavior of Keyword Spotting Networks Under Noisy Conditions
Anwesh Mohanty
Adrian Frischknecht
Christoph Gerum
Oliver Bringmann
6
1
0
15 Sep 2021
Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation
Juan Manuel Coria
H. Bredin
Sahar Ghannay
Sophie Rosset
36
30
0
14 Sep 2021
Complementing Handcrafted Features with Raw Waveform Using a Light-weight Auxiliary Model
Zhongwei Teng
Quchen Fu
Jules White
Maria E. Powell
Douglas C. Schmidt
8
5
0
06 Sep 2021
Learning Sparse Analytic Filters for Piano Transcription
Frank Cwitkowitz
M. Heydari
Z. Duan
19
2
0
23 Aug 2021
Using Large Pre-Trained Models with Cross-Modal Attention for Multi-Modal Emotion Recognition
Krishna D N Freshworks
14
11
0
22 Aug 2021
Curricular SincNet: Towards Robust Deep Speaker Recognition by Emphasizing Hard Samples in Latent Space
Labib Chowdhury
M. Kamal
Najia Hasan
Nabeel Mohammed
11
3
0
21 Aug 2021
On the Exploitability of Audio Machine Learning Pipelines to Surreptitious Adversarial Examples
Adelin Travers
Lorna Licollari
Guanghan Wang
Varun Chandrasekaran
Adam Dziedzic
David Lie
Nicolas Papernot
AAML
13
3
0
03 Aug 2021
A Multi-Head Relevance Weighting Framework For Learning Raw Waveform Audio Representations
Debottam Dutta
Purvi Agrawal
Sriram Ganapathy
6
2
0
30 Jul 2021
End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection
Hemlata Tak
Jee-weon Jung
J. Patino
Madhu R. Kamble
Massimiliano Todisco
Nicholas W. D. Evans
11
157
0
27 Jul 2021
Use of speaker recognition approaches for learning and evaluating embedding representations of musical instrument sounds
Xuan Shi
Erica Cooper
Junichi Yamagishi
24
7
0
24 Jul 2021
SVSNet: An End-to-end Speaker Voice Similarity Assessment Model
Cheng-Hung Hu
Yu-Huai Peng
Junichi Yamagishi
Yu Tsao
Hsin-Min Wang
13
5
0
20 Jul 2021
Human Perception of Audio Deepfakes
Nicolas M. Muller
Karla Markert
Konstantin Böttinger
11
49
0
20 Jul 2021
PERSA+: A Deep Learning Front-End for Context-Agnostic Audio Classification
Lazaros Vrysis
Iordanis Thoidis
Charalampos A. Dimoulas
G. Papanikolaou
VLM
17
0
0
20 Jul 2021
Interpretable SincNet-based Deep Learning for Emotion Recognition from EEG brain activity
J. M. M. Torres
Mirco Ravanelli
Sara E. Medina-DeVilliers
M. Lerner
Giuseppe Riccardi
11
21
0
18 Jul 2021
Previous
1
2
3
4
5
6
Next