Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1808.00158
Cited By
Speaker Recognition from Raw Waveform with SincNet
29 July 2018
Mirco Ravanelli
Yoshua Bengio
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Speaker Recognition from Raw Waveform with SincNet"
50 / 259 papers shown
Title
Optimization of data-driven filterbank for automatic speaker verification
S. K. Sarangi
Md. Sahidullah
G. Saha
17
38
0
21 Jul 2020
Memory based fusion for multi-modal deep learning
Darshana Priyasad
Tharindu Fernando
Simon Denman
S. Sridharan
Clinton Fookes
4
0
0
16 Jul 2020
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge
Ashish Arora
Desh Raj
Aswin Shanmugam Subramanian
Ke Li
Bar Ben Yair
Matthew Maciejewski
Piotr Żelasko
Leibny Paola García-Perera
Shinji Watanabe
Sanjeev Khudanpur
17
9
0
14 Jun 2020
Uniphore's submission to Fearless Steps Challenge Phase-2
Karthik Pandia
C. Spera
11
0
0
10 Jun 2020
CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning
Sameer Khurana
Antoine Laurent
James R. Glass
SSL
6
12
0
04 Jun 2020
SNR-Based Teachers-Student Technique for Speech Enhancement
Xiang Hao
Xiangdong Su
Zhiyu Wang
Qiang Zhang
Huali Xu
Guanglai Gao
9
15
0
29 May 2020
End-to-End Auditory Object Recognition via Inception Nucleus
M. K. Ebrahimpour
Timothy M. Shea
Andreea Danielescu
D. Noelle
Christopher T. Kello
6
8
0
25 May 2020
Identify Speakers in Cocktail Parties with End-to-End Attention
Junzhe Zhu
M. Hasegawa-Johnson
Leda Sari
6
2
0
22 May 2020
Active Speakers in Context
Juan Carlos León Alcázar
Fabian Caba Heilbron
Long Mai
Federico Perazzi
Joon-Young Lee
Pablo Arbelaez
Bernard Ghanem
12
61
0
20 May 2020
Speech to Text Adaptation: Towards an Efficient Cross-Modal Distillation
Won Ik Cho
Donghyun Kwak
J. Yoon
N. Kim
18
26
0
17 May 2020
Asteroid: the PyTorch-based audio source separation toolkit for researchers
Manuel Pariente
Samuele Cornell
Joris Cosentino
S. Sivasankaran
Efthymios Tzinis
...
Juan M. Martín-Donas
David Ditter
Ariel Frank
Antoine Deleforge
Emmanuel Vincent
9
151
0
08 May 2020
Segment Aggregation for short utterances speaker verification using raw waveforms
Seung-bin Kim
Jee-weon Jung
Hye-jin Shim
Ju-ho Kim
Ha-Jin Yu
6
5
0
07 May 2020
Cross-modal Speaker Verification and Recognition: A Multilingual Perspective
M. S. Saeed
Shah Nawaz
Pietro Morerio
Arif Mahmood
I. Gallo
Muhammad Haroon Yousaf
Alessio Del Bue
CVBM
12
25
0
28 Apr 2020
From Inference to Generation: End-to-end Fully Self-supervised Generation of Human Face from Speech
Hyeong-Seok Choi
Changdae Park
Kyogu Lee
CVBM
6
29
0
13 Apr 2020
Learning to fool the speaker recognition
Jiguo Li
Xinfeng Zhang
Jizheng Xu
Li Zhang
Y. Wang
Siwei Ma
Wen Gao
AAML
25
21
0
07 Apr 2020
Universal Adversarial Perturbations Generative Network for Speaker Recognition
Jiguo Li
Xinfeng Zhang
Chuanmin Jia
Jizheng Xu
Li Zhang
Y. Wang
Siwei Ma
Wen Gao
AAML
6
44
0
07 Apr 2020
Speaker Recognition using SincNet and X-Vector Fusion
Mayank Tripathi
Divyanshu Singh
Seba Susan
12
7
0
05 Apr 2020
Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw Waveforms
Jee-weon Jung
Seung-bin Kim
Hye-jin Shim
Ju-ho Kim
Ha-Jin Yu
8
60
0
01 Apr 2020
AM-MobileNet1D: A Portable Model for Speaker Recognition
João Antônio Chagas Nunes
David Macêdo
Cleber Zanchettin
12
22
0
31 Mar 2020
A Comparison of Metric Learning Loss Functions for End-To-End Speaker Verification
Juan Manuel Coria
H. Bredin
Sahar Ghannay
S. Rosset
12
15
0
31 Mar 2020
In defence of metric learning for speaker recognition
Joon Son Chung
Jaesung Huh
Seongkyu Mun
Minjae Lee
Hee-Soo Heo
Soyeon Choe
Chiheon Ham
Sung-Ye Jung
Bong-Jin Lee
Icksang Han
12
430
0
26 Mar 2020
Speaker Identification using EEG
G. Krishna
Co Tran
Mason Carnahan
Ahmed H. Tewfik
6
0
0
07 Mar 2020
CGCNN: Complex Gabor Convolutional Neural Network on raw speech
Paul-Gauthier Noé
Titouan Parcollet
Mohamed Morchid
14
29
0
11 Feb 2020
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Junaid Qadir
Björn W. Schuller
AI4TS
21
81
0
02 Jan 2020
Large-scale Multi-modal Person Identification in Real Unconstrained Environments
Jiajie Ye
Y. Guan
Junfa Liu
Xinghong Huang
Hong Zhang
10
1
0
17 Dec 2019
Speaker detection in the wild: Lessons learned from JSALT 2019
Leibny Paola García-Perera
Jesus Villalba
H. Bredin
Jun Du
Diego Castán
...
Wassim Bouaziz
Hadrien Titeux
Emmanuel Dupoux
Kong Aik Lee
Najim Dehak
6
29
0
02 Dec 2019
Deep learning methods in speaker recognition: a review
Dávid Sztahó
György Szaszák
A. Beke
VLM
16
46
0
14 Nov 2019
WaveletKernelNet: An Interpretable Deep Neural Network for Industrial Intelligent Diagnosis
Tianfu Li
Zhibin Zhao
Chuang Sun
Li Cheng
Xuefeng Chen
Ruqaing Yan
R. Gao
11
313
0
12 Nov 2019
Small-Footprint Keyword Spotting on Raw Audio Data with Sinc-Convolutions
Simon Mittermaier
Ludwig Kurzinger
Bernd Waschneck
Gerhard Rigoll
6
57
0
05 Nov 2019
pyannote.audio: neural building blocks for speaker diarization
H. Bredin
Ruiqing Yin
Juan Manuel Coria
G. Gelly
Pavel Korshunov
Marvin Lavechin
D. Fustes
Hadrien Titeux
Wassim Bouaziz
Marie-Philippe Gill
183
310
0
04 Nov 2019
Sum-Product Networks for Robust Automatic Speaker Identification
Aaron Nicolson
K. Paliwal
TPM
17
1
0
26 Oct 2019
Overlap-aware diarization: resegmentation using neural end-to-end overlapped speech detection
Latané Bullock
H. Bredin
Leibny Paola García-Perera
14
94
0
25 Oct 2019
Filterbank design for end-to-end speech separation
Manuel Pariente
Samuele Cornell
Antoine Deleforge
Emmanuel Vincent
14
69
0
23 Oct 2019
Cross-Representation Transferability of Adversarial Attacks: From Spectrograms to Audio Waveforms
K. M. Koerich
M. Esmailpour
Sajjad Abdoli
A. Britto
Alessandro Lameiras Koerich
AAML
22
1
0
22 Oct 2019
Acoustic Model Adaptation from Raw Waveforms with SincNet
Joachim Fainberg
Ondˇrej Klejch
Erfan Loweimi
P. Bell
Steve Renals
4
14
0
30 Sep 2019
Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional Networks
Changle Liu
Sze-Wei Fu
You-Jin Li
Jen-Wei Huang
Hsin-Min Wang
Yu Tsao
11
49
0
26 Sep 2019
Understanding Semantics from Speech Through Pre-training
P. Wang
Liangchen Wei
Yong Cao
Jinghui Xie
Yuji Cao
Zaiqing Nie
SSL
VLM
6
6
0
24 Sep 2019
Neural Harmonic-plus-Noise Waveform Model with Trainable Maximum Voice Frequency for Text-to-Speech Synthesis
Xin Wang
Junichi Yamagishi
6
31
0
27 Aug 2019
Universal Adversarial Audio Perturbations
Sajjad Abdoli
L. G. Hafemann
Jérôme Rony
Ismail Ben Ayed
P. Cardinal
Alessandro Lameiras Koerich
AAML
22
51
0
08 Aug 2019
Sound source detection, localization and classification using consecutive ensemble of CRNN models
Slawomir Kapka
M. Lewandowski
11
65
0
02 Aug 2019
Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge
Hossein Zeinali
Themos Stafylakis
Georgia Athanasopoulou
Johan Rohdin
Ioannis Gkinis
L. Burget
J. Černocký
21
65
0
13 Jul 2019
Multi-Task Semi-Supervised Adversarial Autoencoding for Speech Emotion Recognition
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
J. Epps
Björn W. Schuller
31
99
0
13 Jul 2019
Towards Explainable Music Emotion Recognition: The Route via Mid-level Features
Shreyan Chowdhury
Andreu Vall
Verena Haunschmid
Gerhard Widmer
14
35
0
08 Jul 2019
Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification
Youngmoon Jung
Younggwan Kim
Hyungjun Lim
Yeunju Choi
Hoirin Kim
13
32
0
19 Jun 2019
Deep Learning for Audio Signal Processing
Hendrik Purwins
Bo-wen Li
Tuomas Virtanen
Jan Schlüter
Shuo-yiin Chang
Tara N. Sainath
VLM
19
578
0
30 Apr 2019
Improving Deep Speech Denoising by Noisy2Noisy Signal Mapping
N. Alamdari
A. Azarang
N. Kehtarnavaz
11
42
0
26 Apr 2019
End-to-End Environmental Sound Classification using a 1D Convolutional Neural Network
Sajjad Abdoli
P. Cardinal
Alessandro Lameiras Koerich
24
268
0
18 Apr 2019
RawNet: Advanced end-to-end deep neural network using raw waveforms for text-independent speaker verification
Jee-weon Jung
Hee-Soo Heo
Ju-ho Kim
Hye-jin Shim
Ha-Jin Yu
13
138
0
17 Apr 2019
Audio-Visual Model Distillation Using Acoustic Images
Andrés F. Pérez
Valentina Sanguineti
Pietro Morerio
Vittorio Murino
VLM
8
27
0
16 Apr 2019
Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Gene-Ping Yang
Chao-I Tuan
Hung-yi Lee
Lin-Shan Lee
17
25
0
16 Apr 2019
Previous
1
2
3
4
5
6
Next