Speaker Recognition from Raw Waveform with SincNet

29 July 2018

Mirco Ravanelli

Papers citing "Speaker Recognition from Raw Waveform with SincNet"

50 / 259 papers shown

Title
Optimization of data-driven filterbank for automatic speaker verification S. K. Sarangi Md. Sahidullah G. Saha 17 38 0 21 Jul 2020
Memory based fusion for multi-modal deep learning Darshana Priyasad Tharindu Fernando Simon Denman S. Sridharan Clinton Fookes 4 0 0 16 Jul 2020
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge Ashish Arora Desh Raj Aswin Shanmugam Subramanian Ke Li Bar Ben Yair Matthew Maciejewski Piotr Żelasko Leibny Paola García-Perera Shinji Watanabe Sanjeev Khudanpur 17 9 0 14 Jun 2020
Uniphore's submission to Fearless Steps Challenge Phase-2 Karthik Pandia C. Spera 11 0 0 10 Jun 2020
CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning Sameer Khurana Antoine Laurent James R. Glass SSL 6 12 0 04 Jun 2020
SNR-Based Teachers-Student Technique for Speech Enhancement Xiang Hao Xiangdong Su Zhiyu Wang Qiang Zhang Huali Xu Guanglai Gao 9 15 0 29 May 2020
End-to-End Auditory Object Recognition via Inception Nucleus M. K. Ebrahimpour Timothy M. Shea Andreea Danielescu D. Noelle Christopher T. Kello 6 8 0 25 May 2020
Identify Speakers in Cocktail Parties with End-to-End Attention Junzhe Zhu M. Hasegawa-Johnson Leda Sari 6 2 0 22 May 2020
Active Speakers in Context Juan Carlos León Alcázar Fabian Caba Heilbron Long Mai Federico Perazzi Joon-Young Lee Pablo Arbelaez Bernard Ghanem 12 61 0 20 May 2020
Speech to Text Adaptation: Towards an Efficient Cross-Modal Distillation Won Ik Cho Donghyun Kwak J. Yoon N. Kim 18 26 0 17 May 2020
Asteroid: the PyTorch-based audio source separation toolkit for researchers Manuel Pariente Samuele Cornell Joris Cosentino S. Sivasankaran Efthymios Tzinis ... Juan M. Martín-Donas David Ditter Ariel Frank Antoine Deleforge Emmanuel Vincent 9 151 0 08 May 2020
Segment Aggregation for short utterances speaker verification using raw waveforms Seung-bin Kim Jee-weon Jung Hye-jin Shim Ju-ho Kim Ha-Jin Yu 6 5 0 07 May 2020
Cross-modal Speaker Verification and Recognition: A Multilingual Perspective M. S. Saeed Shah Nawaz Pietro Morerio Arif Mahmood I. Gallo Muhammad Haroon Yousaf Alessio Del Bue CVBM 12 25 0 28 Apr 2020
From Inference to Generation: End-to-end Fully Self-supervised Generation of Human Face from Speech Hyeong-Seok Choi Changdae Park Kyogu Lee CVBM 6 29 0 13 Apr 2020
Learning to fool the speaker recognition Jiguo Li Xinfeng Zhang Jizheng Xu Li Zhang Y. Wang Siwei Ma Wen Gao AAML 25 21 0 07 Apr 2020
Universal Adversarial Perturbations Generative Network for Speaker Recognition Jiguo Li Xinfeng Zhang Chuanmin Jia Jizheng Xu Li Zhang Y. Wang Siwei Ma Wen Gao AAML 6 44 0 07 Apr 2020
Speaker Recognition using SincNet and X-Vector Fusion Mayank Tripathi Divyanshu Singh Seba Susan 12 7 0 05 Apr 2020
Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw Waveforms Jee-weon Jung Seung-bin Kim Hye-jin Shim Ju-ho Kim Ha-Jin Yu 8 60 0 01 Apr 2020
AM-MobileNet1D: A Portable Model for Speaker Recognition João Antônio Chagas Nunes David Macêdo Cleber Zanchettin 12 22 0 31 Mar 2020
A Comparison of Metric Learning Loss Functions for End-To-End Speaker Verification Juan Manuel Coria H. Bredin Sahar Ghannay S. Rosset 12 15 0 31 Mar 2020
In defence of metric learning for speaker recognition Joon Son Chung Jaesung Huh Seongkyu Mun Minjae Lee Hee-Soo Heo Soyeon Choe Chiheon Ham Sung-Ye Jung Bong-Jin Lee Icksang Han 12 430 0 26 Mar 2020
Speaker Identification using EEG G. Krishna Co Tran Mason Carnahan Ahmed H. Tewfik 6 0 0 07 Mar 2020
CGCNN: Complex Gabor Convolutional Neural Network on raw speech Paul-Gauthier Noé Titouan Parcollet Mohamed Morchid 14 29 0 11 Feb 2020
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends S. Latif R. Rana Sara Khalifa Raja Jurdak Junaid Qadir Björn W. Schuller AI4TS 21 81 0 02 Jan 2020
Large-scale Multi-modal Person Identification in Real Unconstrained Environments Jiajie Ye Y. Guan Junfa Liu Xinghong Huang Hong Zhang 10 1 0 17 Dec 2019
Speaker detection in the wild: Lessons learned from JSALT 2019 Leibny Paola García-Perera Jesus Villalba H. Bredin Jun Du Diego Castán ... Wassim Bouaziz Hadrien Titeux Emmanuel Dupoux Kong Aik Lee Najim Dehak 6 29 0 02 Dec 2019
Deep learning methods in speaker recognition: a review Dávid Sztahó György Szaszák A. Beke VLM 16 46 0 14 Nov 2019
WaveletKernelNet: An Interpretable Deep Neural Network for Industrial Intelligent Diagnosis Tianfu Li Zhibin Zhao Chuang Sun Li Cheng Xuefeng Chen Ruqaing Yan R. Gao 11 313 0 12 Nov 2019
Small-Footprint Keyword Spotting on Raw Audio Data with Sinc-Convolutions Simon Mittermaier Ludwig Kurzinger Bernd Waschneck Gerhard Rigoll 6 57 0 05 Nov 2019
pyannote.audio: neural building blocks for speaker diarization H. Bredin Ruiqing Yin Juan Manuel Coria G. Gelly Pavel Korshunov Marvin Lavechin D. Fustes Hadrien Titeux Wassim Bouaziz Marie-Philippe Gill 183 310 0 04 Nov 2019
Sum-Product Networks for Robust Automatic Speaker Identification Aaron Nicolson K. Paliwal TPM 17 1 0 26 Oct 2019
Overlap-aware diarization: resegmentation using neural end-to-end overlapped speech detection Latané Bullock H. Bredin Leibny Paola García-Perera 14 94 0 25 Oct 2019
Filterbank design for end-to-end speech separation Manuel Pariente Samuele Cornell Antoine Deleforge Emmanuel Vincent 14 69 0 23 Oct 2019
Cross-Representation Transferability of Adversarial Attacks: From Spectrograms to Audio Waveforms K. M. Koerich M. Esmailpour Sajjad Abdoli A. Britto Alessandro Lameiras Koerich AAML 22 1 0 22 Oct 2019
Acoustic Model Adaptation from Raw Waveforms with SincNet Joachim Fainberg Ondˇrej Klejch Erfan Loweimi P. Bell Steve Renals 4 14 0 30 Sep 2019
Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional Networks Changle Liu Sze-Wei Fu You-Jin Li Jen-Wei Huang Hsin-Min Wang Yu Tsao 11 49 0 26 Sep 2019
Understanding Semantics from Speech Through Pre-training P. Wang Liangchen Wei Yong Cao Jinghui Xie Yuji Cao Zaiqing Nie SSL VLM 6 6 0 24 Sep 2019
Neural Harmonic-plus-Noise Waveform Model with Trainable Maximum Voice Frequency for Text-to-Speech Synthesis Xin Wang Junichi Yamagishi 6 31 0 27 Aug 2019
Universal Adversarial Audio Perturbations Sajjad Abdoli L. G. Hafemann Jérôme Rony Ismail Ben Ayed P. Cardinal Alessandro Lameiras Koerich AAML 22 51 0 08 Aug 2019
Sound source detection, localization and classification using consecutive ensemble of CRNN models Slawomir Kapka M. Lewandowski 11 65 0 02 Aug 2019
Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge Hossein Zeinali Themos Stafylakis Georgia Athanasopoulou Johan Rohdin Ioannis Gkinis L. Burget J. Černocký 21 65 0 13 Jul 2019
Multi-Task Semi-Supervised Adversarial Autoencoding for Speech Emotion Recognition S. Latif R. Rana Sara Khalifa Raja Jurdak J. Epps Björn W. Schuller 31 99 0 13 Jul 2019
Towards Explainable Music Emotion Recognition: The Route via Mid-level Features Shreyan Chowdhury Andreu Vall Verena Haunschmid Gerhard Widmer 14 35 0 08 Jul 2019
Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification Youngmoon Jung Younggwan Kim Hyungjun Lim Yeunju Choi Hoirin Kim 13 32 0 19 Jun 2019
Deep Learning for Audio Signal Processing Hendrik Purwins Bo-wen Li Tuomas Virtanen Jan Schlüter Shuo-yiin Chang Tara N. Sainath VLM 19 578 0 30 Apr 2019
Improving Deep Speech Denoising by Noisy2Noisy Signal Mapping N. Alamdari A. Azarang N. Kehtarnavaz 11 42 0 26 Apr 2019
End-to-End Environmental Sound Classification using a 1D Convolutional Neural Network Sajjad Abdoli P. Cardinal Alessandro Lameiras Koerich 24 268 0 18 Apr 2019
RawNet: Advanced end-to-end deep neural network using raw waveforms for text-independent speaker verification Jee-weon Jung Hee-Soo Heo Ju-ho Kim Hye-jin Shim Ha-Jin Yu 13 138 0 17 Apr 2019
Audio-Visual Model Distillation Using Acoustic Images Andrés F. Pérez Valentina Sanguineti Pietro Morerio Vittorio Murino VLM 8 27 0 16 Apr 2019
Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering Gene-Ping Yang Chao-I Tuan Hung-yi Lee Lin-Shan Lee 17 25 0 16 Apr 2019