v1v2 (latest)

End-to-End Speech Recognition From the Raw Waveform

19 June 2018

Papers citing "End-to-End Speech Recognition From the Raw Waveform"

34 / 34 papers shown

RepAugment: Input-Agnostic Representation-Level Augmentation for Respiratory Sound ClassificationAnnual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2024

268

05 May 2024

Learning neural audio features without supervisionInterspeech (Interspeech), 2022

Sarthak Yadav

Neil Zeghidour

SSL

150

29 Mar 2022

Shennong: a Python toolbox for audio speech features extraction

216

10 Dec 2021

Deep Spoken Keyword Spotting: An OverviewIEEE Access (IEEE Access), 2021

250

140

20 Nov 2021

Recent Advances in End-to-End Automatic Speech RecognitionAPSIPA Transactions on Signal and Information Processing (TASIP), 2021

Jinyu Li

VLM

509

443

02 Nov 2021

Beyond

L_p

clipping: Equalization-based Psychoacoustic Attacks against ASRs

H. Abdullah

Muhammad Sajidur Rahman

Christian Peeters

Cassidy Gibson

Washington Garcia

Vincent Bindschaedler

T. Shrimpton

Patrick Traynor

AAML

125

25 Oct 2021

Learning Sparse Analytic Filters for Piano Transcription

Frank Cwitkowitz

M. Heydari

Z. Duan

346

23 Aug 2021

Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech RecognitionInterspeech (Interspeech), 2021

162

08 Jun 2021

Interpreting intermediate convolutional layers of generative CNNs trained on waveformsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021

Gašper Beguš

Alan Zhou

362

19 Apr 2021

End-to-end Audio-visual Speech Recognition with ConformersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Pingchuan Ma

Stavros Petridis

Maja Pantic

358

294

12 Feb 2021

LEAF: A Learnable Frontend for Audio ClassificationInternational Conference on Learning Representations (ICLR), 2021

Neil Zeghidour

O. Teboul

Félix de Chaumont Quitry

Marco Tagliasacchi

VLM AAML

280

175

21 Jan 2021

Speech Command Recognition in Computationally Constrained Environments with a Quadratic Self-organized Operational LayerIEEE International Joint Conference on Neural Network (IJCNN), 2020

342

23 Nov 2020

Lightweight End-to-End Speech Recognition from Raw Audio Data Using Sinc-Convolutions

325

15 Oct 2020

End-to-End Bengali Speech Recognition

S. Mandal

Sarthak Yadav

A. Rai

117

21 Sep 2020

Exploring Filterbank Learning for Keyword SpottingEuropean Signal Processing Conference (EUSIPCO), 2020

Iván López-Espejo

Zheng-Hua Tan

Jesper Jensen

210

30 May 2020

PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR

Sanjeev Khudanpur

218

20 May 2020

CGCNN: Complex Gabor Convolutional Neural Network on raw speechIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Paul-Gauthier Noé

Titouan Parcollet

Mohamed Morchid

166

11 Feb 2020

Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural NetworksAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2019

Jingdong Li

Hui Zhang

Xueliang Zhang

Changliang Li

175

02 Feb 2020

Machine learning for music genre: multifaceted review and experimentation with audiosetJournal of Intelligence and Information Systems (JIIS), 2019

Jaime Ramírez

M. Flores

VLM

163

28 Nov 2019

Small-Footprint Keyword Spotting on Raw Audio Data with Sinc-ConvolutionsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

322

05 Nov 2019

Universal Adversarial Audio Perturbations

Alessandro Lameiras Koerich

AAML

476

08 Aug 2019

Learning Waveform-Based Acoustic Models using Deep Variational Convolutional Neural NetworksIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2019

318

23 Jun 2019

End-to-End ASR for Code-switched Hindi-English Speech

131

22 Jun 2019

Multi-Stream End-to-End Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2019

Ruizhi Li

Xiaofei Wang

Sri Harish Reddy Mallidi

Shinji Watanabe

Takaaki Hori

H. Hermansky

248

17 Jun 2019

End-to-End Environmental Sound Classification using a 1D Convolutional Neural Network

Sajjad Abdoli

P. Cardinal

Alessandro Lameiras Koerich

221

307

18 Apr 2019

RawNet: Fast End-to-End Neural Vocoder

Yunchao He

Yujun Wang

205

10 Apr 2019

Frequency Domain Multi-channel Acoustic Modeling for Distant Speech Recognition

226

13 Mar 2019

Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition

294

124

22 Jan 2019

Towards Using Context-Dependent Symbols in CTC Without State-Tying Decision Trees

216

14 Jan 2019

Exploring spectro-temporal features in end-to-end convolutional neural networks

Sean Robertson

Gerald Penn

Yingxue Wang

191

01 Jan 2019

Fully Convolutional Speech Recognition

296

17 Dec 2018

Learning to detect dysarthria from raw speech

Juliette Millet

Neil Zeghidour

280

27 Nov 2018

Multi-encoder multi-resolution framework for end-to-end speech recognition

Ruizhi Li

Xiaofei Wang

Sri Harish Reddy Mallidi

Takaaki Hori

Shinji Watanabe

H. Hermansky

164

12 Nov 2018

Single-Microphone Speech Enhancement and Separation Using Deep Learning

Morten Kolbaek

247

31 Aug 2018