ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.00158
  4. Cited By
Speaker Recognition from Raw Waveform with SincNet

Speaker Recognition from Raw Waveform with SincNet

29 July 2018
Mirco Ravanelli
Yoshua Bengio
ArXivPDFHTML

Papers citing "Speaker Recognition from Raw Waveform with SincNet"

50 / 259 papers shown
Title
Optimization of data-driven filterbank for automatic speaker
  verification
Optimization of data-driven filterbank for automatic speaker verification
S. K. Sarangi
Md. Sahidullah
G. Saha
17
38
0
21 Jul 2020
Memory based fusion for multi-modal deep learning
Memory based fusion for multi-modal deep learning
Darshana Priyasad
Tharindu Fernando
Simon Denman
S. Sridharan
Clinton Fookes
4
0
0
16 Jul 2020
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6
  Challenge
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge
Ashish Arora
Desh Raj
Aswin Shanmugam Subramanian
Ke Li
Bar Ben Yair
Matthew Maciejewski
Piotr Żelasko
Leibny Paola García-Perera
Shinji Watanabe
Sanjeev Khudanpur
17
9
0
14 Jun 2020
Uniphore's submission to Fearless Steps Challenge Phase-2
Uniphore's submission to Fearless Steps Challenge Phase-2
Karthik Pandia
C. Spera
11
0
0
10 Jun 2020
CSTNet: Contrastive Speech Translation Network for Self-Supervised
  Speech Representation Learning
CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning
Sameer Khurana
Antoine Laurent
James R. Glass
SSL
6
12
0
04 Jun 2020
SNR-Based Teachers-Student Technique for Speech Enhancement
SNR-Based Teachers-Student Technique for Speech Enhancement
Xiang Hao
Xiangdong Su
Zhiyu Wang
Qiang Zhang
Huali Xu
Guanglai Gao
9
15
0
29 May 2020
End-to-End Auditory Object Recognition via Inception Nucleus
End-to-End Auditory Object Recognition via Inception Nucleus
M. K. Ebrahimpour
Timothy M. Shea
Andreea Danielescu
D. Noelle
Christopher T. Kello
6
8
0
25 May 2020
Identify Speakers in Cocktail Parties with End-to-End Attention
Identify Speakers in Cocktail Parties with End-to-End Attention
Junzhe Zhu
M. Hasegawa-Johnson
Leda Sari
6
2
0
22 May 2020
Active Speakers in Context
Active Speakers in Context
Juan Carlos León Alcázar
Fabian Caba Heilbron
Long Mai
Federico Perazzi
Joon-Young Lee
Pablo Arbelaez
Bernard Ghanem
12
61
0
20 May 2020
Speech to Text Adaptation: Towards an Efficient Cross-Modal Distillation
Speech to Text Adaptation: Towards an Efficient Cross-Modal Distillation
Won Ik Cho
Donghyun Kwak
J. Yoon
N. Kim
18
26
0
17 May 2020
Asteroid: the PyTorch-based audio source separation toolkit for
  researchers
Asteroid: the PyTorch-based audio source separation toolkit for researchers
Manuel Pariente
Samuele Cornell
Joris Cosentino
S. Sivasankaran
Efthymios Tzinis
...
Juan M. Martín-Donas
David Ditter
Ariel Frank
Antoine Deleforge
Emmanuel Vincent
9
151
0
08 May 2020
Segment Aggregation for short utterances speaker verification using raw
  waveforms
Segment Aggregation for short utterances speaker verification using raw waveforms
Seung-bin Kim
Jee-weon Jung
Hye-jin Shim
Ju-ho Kim
Ha-Jin Yu
6
5
0
07 May 2020
Cross-modal Speaker Verification and Recognition: A Multilingual
  Perspective
Cross-modal Speaker Verification and Recognition: A Multilingual Perspective
M. S. Saeed
Shah Nawaz
Pietro Morerio
Arif Mahmood
I. Gallo
Muhammad Haroon Yousaf
Alessio Del Bue
CVBM
12
25
0
28 Apr 2020
From Inference to Generation: End-to-end Fully Self-supervised
  Generation of Human Face from Speech
From Inference to Generation: End-to-end Fully Self-supervised Generation of Human Face from Speech
Hyeong-Seok Choi
Changdae Park
Kyogu Lee
CVBM
6
29
0
13 Apr 2020
Learning to fool the speaker recognition
Learning to fool the speaker recognition
Jiguo Li
Xinfeng Zhang
Jizheng Xu
Li Zhang
Y. Wang
Siwei Ma
Wen Gao
AAML
25
21
0
07 Apr 2020
Universal Adversarial Perturbations Generative Network for Speaker
  Recognition
Universal Adversarial Perturbations Generative Network for Speaker Recognition
Jiguo Li
Xinfeng Zhang
Chuanmin Jia
Jizheng Xu
Li Zhang
Y. Wang
Siwei Ma
Wen Gao
AAML
6
44
0
07 Apr 2020
Speaker Recognition using SincNet and X-Vector Fusion
Speaker Recognition using SincNet and X-Vector Fusion
Mayank Tripathi
Divyanshu Singh
Seba Susan
12
7
0
05 Apr 2020
Improved RawNet with Feature Map Scaling for Text-independent Speaker
  Verification using Raw Waveforms
Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw Waveforms
Jee-weon Jung
Seung-bin Kim
Hye-jin Shim
Ju-ho Kim
Ha-Jin Yu
8
60
0
01 Apr 2020
AM-MobileNet1D: A Portable Model for Speaker Recognition
AM-MobileNet1D: A Portable Model for Speaker Recognition
João Antônio Chagas Nunes
David Macêdo
Cleber Zanchettin
12
22
0
31 Mar 2020
A Comparison of Metric Learning Loss Functions for End-To-End Speaker
  Verification
A Comparison of Metric Learning Loss Functions for End-To-End Speaker Verification
Juan Manuel Coria
H. Bredin
Sahar Ghannay
S. Rosset
12
15
0
31 Mar 2020
In defence of metric learning for speaker recognition
In defence of metric learning for speaker recognition
Joon Son Chung
Jaesung Huh
Seongkyu Mun
Minjae Lee
Hee-Soo Heo
Soyeon Choe
Chiheon Ham
Sung-Ye Jung
Bong-Jin Lee
Icksang Han
12
430
0
26 Mar 2020
Speaker Identification using EEG
Speaker Identification using EEG
G. Krishna
Co Tran
Mason Carnahan
Ahmed H. Tewfik
6
0
0
07 Mar 2020
CGCNN: Complex Gabor Convolutional Neural Network on raw speech
CGCNN: Complex Gabor Convolutional Neural Network on raw speech
Paul-Gauthier Noé
Titouan Parcollet
Mohamed Morchid
14
29
0
11 Feb 2020
Deep Representation Learning in Speech Processing: Challenges, Recent
  Advances, and Future Trends
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Junaid Qadir
Björn W. Schuller
AI4TS
21
81
0
02 Jan 2020
Large-scale Multi-modal Person Identification in Real Unconstrained
  Environments
Large-scale Multi-modal Person Identification in Real Unconstrained Environments
Jiajie Ye
Y. Guan
Junfa Liu
Xinghong Huang
Hong Zhang
10
1
0
17 Dec 2019
Speaker detection in the wild: Lessons learned from JSALT 2019
Speaker detection in the wild: Lessons learned from JSALT 2019
Leibny Paola García-Perera
Jesus Villalba
H. Bredin
Jun Du
Diego Castán
...
Wassim Bouaziz
Hadrien Titeux
Emmanuel Dupoux
Kong Aik Lee
Najim Dehak
6
29
0
02 Dec 2019
Deep learning methods in speaker recognition: a review
Deep learning methods in speaker recognition: a review
Dávid Sztahó
György Szaszák
A. Beke
VLM
16
46
0
14 Nov 2019
WaveletKernelNet: An Interpretable Deep Neural Network for Industrial
  Intelligent Diagnosis
WaveletKernelNet: An Interpretable Deep Neural Network for Industrial Intelligent Diagnosis
Tianfu Li
Zhibin Zhao
Chuang Sun
Li Cheng
Xuefeng Chen
Ruqaing Yan
R. Gao
11
313
0
12 Nov 2019
Small-Footprint Keyword Spotting on Raw Audio Data with
  Sinc-Convolutions
Small-Footprint Keyword Spotting on Raw Audio Data with Sinc-Convolutions
Simon Mittermaier
Ludwig Kurzinger
Bernd Waschneck
Gerhard Rigoll
6
57
0
05 Nov 2019
pyannote.audio: neural building blocks for speaker diarization
pyannote.audio: neural building blocks for speaker diarization
H. Bredin
Ruiqing Yin
Juan Manuel Coria
G. Gelly
Pavel Korshunov
Marvin Lavechin
D. Fustes
Hadrien Titeux
Wassim Bouaziz
Marie-Philippe Gill
183
310
0
04 Nov 2019
Sum-Product Networks for Robust Automatic Speaker Identification
Sum-Product Networks for Robust Automatic Speaker Identification
Aaron Nicolson
K. Paliwal
TPM
17
1
0
26 Oct 2019
Overlap-aware diarization: resegmentation using neural end-to-end
  overlapped speech detection
Overlap-aware diarization: resegmentation using neural end-to-end overlapped speech detection
Latané Bullock
H. Bredin
Leibny Paola García-Perera
14
94
0
25 Oct 2019
Filterbank design for end-to-end speech separation
Filterbank design for end-to-end speech separation
Manuel Pariente
Samuele Cornell
Antoine Deleforge
Emmanuel Vincent
14
69
0
23 Oct 2019
Cross-Representation Transferability of Adversarial Attacks: From
  Spectrograms to Audio Waveforms
Cross-Representation Transferability of Adversarial Attacks: From Spectrograms to Audio Waveforms
K. M. Koerich
M. Esmailpour
Sajjad Abdoli
A. Britto
Alessandro Lameiras Koerich
AAML
22
1
0
22 Oct 2019
Acoustic Model Adaptation from Raw Waveforms with SincNet
Acoustic Model Adaptation from Raw Waveforms with SincNet
Joachim Fainberg
Ondˇrej Klejch
Erfan Loweimi
P. Bell
Steve Renals
4
14
0
30 Sep 2019
Multichannel Speech Enhancement by Raw Waveform-mapping using Fully
  Convolutional Networks
Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional Networks
Changle Liu
Sze-Wei Fu
You-Jin Li
Jen-Wei Huang
Hsin-Min Wang
Yu Tsao
11
49
0
26 Sep 2019
Understanding Semantics from Speech Through Pre-training
Understanding Semantics from Speech Through Pre-training
P. Wang
Liangchen Wei
Yong Cao
Jinghui Xie
Yuji Cao
Zaiqing Nie
SSL
VLM
6
6
0
24 Sep 2019
Neural Harmonic-plus-Noise Waveform Model with Trainable Maximum Voice
  Frequency for Text-to-Speech Synthesis
Neural Harmonic-plus-Noise Waveform Model with Trainable Maximum Voice Frequency for Text-to-Speech Synthesis
Xin Wang
Junichi Yamagishi
6
31
0
27 Aug 2019
Universal Adversarial Audio Perturbations
Universal Adversarial Audio Perturbations
Sajjad Abdoli
L. G. Hafemann
Jérôme Rony
Ismail Ben Ayed
P. Cardinal
Alessandro Lameiras Koerich
AAML
22
51
0
08 Aug 2019
Sound source detection, localization and classification using
  consecutive ensemble of CRNN models
Sound source detection, localization and classification using consecutive ensemble of CRNN models
Slawomir Kapka
M. Lewandowski
11
65
0
02 Aug 2019
Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission
  to ASVspoof 2019 Challenge
Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge
Hossein Zeinali
Themos Stafylakis
Georgia Athanasopoulou
Johan Rohdin
Ioannis Gkinis
L. Burget
J. Černocký
21
65
0
13 Jul 2019
Multi-Task Semi-Supervised Adversarial Autoencoding for Speech Emotion
  Recognition
Multi-Task Semi-Supervised Adversarial Autoencoding for Speech Emotion Recognition
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
J. Epps
Björn W. Schuller
31
99
0
13 Jul 2019
Towards Explainable Music Emotion Recognition: The Route via Mid-level
  Features
Towards Explainable Music Emotion Recognition: The Route via Mid-level Features
Shreyan Chowdhury
Andreu Vall
Verena Haunschmid
Gerhard Widmer
14
35
0
08 Jul 2019
Spatial Pyramid Encoding with Convex Length Normalization for
  Text-Independent Speaker Verification
Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification
Youngmoon Jung
Younggwan Kim
Hyungjun Lim
Yeunju Choi
Hoirin Kim
13
32
0
19 Jun 2019
Deep Learning for Audio Signal Processing
Deep Learning for Audio Signal Processing
Hendrik Purwins
Bo-wen Li
Tuomas Virtanen
Jan Schlüter
Shuo-yiin Chang
Tara N. Sainath
VLM
19
578
0
30 Apr 2019
Improving Deep Speech Denoising by Noisy2Noisy Signal Mapping
Improving Deep Speech Denoising by Noisy2Noisy Signal Mapping
N. Alamdari
A. Azarang
N. Kehtarnavaz
11
42
0
26 Apr 2019
End-to-End Environmental Sound Classification using a 1D Convolutional
  Neural Network
End-to-End Environmental Sound Classification using a 1D Convolutional Neural Network
Sajjad Abdoli
P. Cardinal
Alessandro Lameiras Koerich
24
268
0
18 Apr 2019
RawNet: Advanced end-to-end deep neural network using raw waveforms for
  text-independent speaker verification
RawNet: Advanced end-to-end deep neural network using raw waveforms for text-independent speaker verification
Jee-weon Jung
Hee-Soo Heo
Ju-ho Kim
Hye-jin Shim
Ha-Jin Yu
13
138
0
17 Apr 2019
Audio-Visual Model Distillation Using Acoustic Images
Audio-Visual Model Distillation Using Acoustic Images
Andrés F. Pérez
Valentina Sanguineti
Pietro Morerio
Vittorio Murino
VLM
8
27
0
16 Apr 2019
Improved Speech Separation with Time-and-Frequency Cross-domain Joint
  Embedding and Clustering
Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Gene-Ping Yang
Chao-I Tuan
Hung-yi Lee
Lin-Shan Lee
17
25
0
16 Apr 2019
Previous
123456
Next