Title
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition Binbin Zhang Hang Lv Pengcheng Guo Qijie Shao Chao Yang ... Hui Bu Xiaoyu Chen Chenchen Zeng Di Wu Zhendong Peng 321 279 0 07 Oct 2021
Back from the future: bidirectional CTC decoding using future information in speech recognition Namkyu Jung Geon-min Kim Han-Gyu Kim 207 3 0 07 Oct 2021
BERT Attends the Conversation: Improving Low-Resource Conversational ASR Pablo Ortiz Simen Burud 116 5 0 05 Oct 2021
Building a Noisy Audio Dataset to Evaluate Machine Learning Approaches for Automatic Speech Recognition Systems J. C. Duarte S. Colcher 54 4 0 04 Oct 2021
Adversarial Regression with Doubly Non-negative Weighting Matrices Tam Le Truyen V. Nguyen M. Yamada Jose H. Blanchet Viet Anh Nguyen 185 5 0 30 Sep 2021
VoxCeleb Enrichment for Age and Gender Recognition Khaled Hechmi Trung Ngo Trong Ville Hautamaki Tomi Kinnunen 168 37 0 28 Sep 2021
DeepStroke: An Efficient Stroke Screening Framework for Emergency Rooms with Multimodal Adversarial Deep Learning Tongan Cai Haomiao Ni Ming-Chieh Yu Xiaolei Huang K. Wong John Volpi Chao Guo Stephen T. C. Wong 143 25 0 24 Sep 2021
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition Guolin Zheng Yubei Xiao Ke Gong Pan Zhou Xiaodan Liang Liang Lin 170 27 0 19 Sep 2021
Enforcing fairness in private federated learning via the modified method of differential multipliers Borja Rodríguez Gálvez Filip Granqvist Rogier van Dalen M. Seigel FedML 199 58 0 17 Sep 2021
Continuous Streaming Multi-Talker ASR with Dual-path Transducers Desh Raj Liang Lu Zhuo Chen Yashesh Gaur Jinyu Li 107 19 0 17 Sep 2021
PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription Chen Zhang Jiaxing Yu Luchin Chang Xu Tan Jiawei Chen Tao Qin Kecheng Zhang 127 16 0 16 Sep 2021
Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition Felix Wu Kwangyoun Kim Jing Pan Kyu Jeong Han Kilian Q. Weinberger Yoav Artzi 157 82 0 14 Sep 2021
BioNetExplorer: Architecture-Space Exploration of Bio-Signal Processing Deep Neural Networks for WearablesIEEE Internet of Things Journal (IEEE IoT Journal), 2021 B. Prabakaran Asima Akhtar Semeen Rehman Osman Hasan Mohamed Bennai 97 11 0 07 Sep 2021
SEC4SR: A Security Analysis Platform for Speaker Recognition Guangke Chen Zhe Zhao Fu Song Sen Chen Lingling Fan Yang Liu AAML 139 13 0 04 Sep 2021
SimulLR: Simultaneous Lip Reading Transducer with Attention-Guided Adaptive MemoryACM Multimedia (ACM MM), 2021 Zhijie Lin Zhou Zhao Haoyuan Li Jinglin Liu Meng Zhang Xingshan Zeng Xiaofei He 121 18 0 31 Aug 2021
Speaker-Conditioned Hierarchical Modeling for Automated Speech ScoringInternational Conference on Information and Knowledge Management (CIKM), 2021 Yaman Kumar Singla Avykat Gupta Shaurya Bagga Changyou Chen Balaji Krishnamurthy R. Shah 164 15 0 30 Aug 2021
CrossedWires: A Dataset of Syntactically Equivalent but Semantically Disparate Deep Learning Models Max Zvyagin Thomas Brettin Arvind Ramanathan Sumit Kumar Jha 104 1 0 29 Aug 2021
Generalizing RNN-Transducer to Out-Domain Audio via Sparse Self-Attention LayersInterspeech (Interspeech), 2021 Juntae Kim Jee-Hye Lee 178 8 0 22 Aug 2021
Time-Frequency Localization Using Deep Convolutional Maxout Neural Network in Persian Speech RecognitionNeural Processing Letters (NPL), 2021 Arash Dehghani Seyyed Ali Seyyedsalehi 195 1 0 09 Aug 2021
Online Evolutionary Batch Size Orchestration for Scheduling Deep Learning Workloads in GPU ClustersInternational Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2021 Chen Sun Shenggui Li Jinyue Wang Jun Yu 156 51 0 08 Aug 2021
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent ClassificationInterspeech (Interspeech), 2021 Yiding Jiang Bidisha Sharma Maulik C. Madhavi Haizhou Li 157 29 0 05 Aug 2021
Imperceptible Adversarial Examples by Spatial Chroma-Shift A. Aydin Deniz Sen Berat Tuna Karli Oguz Hanoglu A. Temi̇zel AAML 137 18 0 05 Aug 2021
Spartus: A 9.4 TOp/s FPGA-based LSTM Accelerator Exploiting Spatio-Temporal SparsityIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021 Chang Gao T. Delbruck Shih-Chii Liu 273 52 0 04 Aug 2021
Transformer-based Map Matching Model with Limited Ground-Truth Data using Transfer-Learning Approach Zhixiong Jin Jiwon Kim H. Yeo Seongjin Choi 203 33 0 01 Aug 2021
The History of Speech Recognition to the Year 2030 Awni Y. Hannun AI4TS 205 24 0 30 Jul 2021
Brazilian Portuguese Speech Recognition Using Wav2vec 2.0International Conference on Computational Processing of the Portuguese Language (PROPOR), 2021 L. Gris Edresson Casanova F. S. Oliveira A. S. Soares A. Júnior 148 22 0 23 Jul 2021
Semantic Communications for Speech RecognitionGlobal Communications Conference (GLOBECOM), 2021 Zhenzi Weng Zhijin Qin Geoffrey Ye Li 145 40 0 22 Jul 2021
CREW: Computation Reuse and Efficient Weight Storage for Hardware-accelerated MLPs and RNNsJournal of systems architecture (JSA), 2021 Marc Riera J. Arnau Antonio González 70 5 0 20 Jul 2021
Perceptual-based deep-learning denoiser as a defense against adversarial attacks on ASR systems Anirudh Sreeram Nicholas Mehlman Raghuveer Peri D. Knox Shrikanth Narayanan 81 6 0 12 Jul 2021
ReconVAT: A Semi-Supervised Automatic Music Transcription Framework for Low-Resource Real-World Data K. Cheuk Dorien Herremans Li Su 329 39 0 11 Jul 2021
Advancing CTC-CRF Based End-to-End Speech Recognition with Wordpieces and Conformers Huahuan Zheng Wenjie Peng Zhijian Ou Jinsong Zhang 197 5 0 07 Jul 2021
ARM-Net: Adaptive Relation Modeling Network for Structured Data Shaofeng Cai Kaiping Zheng Gang Chen H. V. Jagadish Beng Chin Ooi Meihui Zhang 245 61 0 05 Jul 2021
CrowdSpeech and VoxDIY: Benchmark Datasets for Crowdsourced Audio Transcription Nikita Pavlichenko Ivan Stelmakh Dmitry Ustalov 130 20 0 02 Jul 2021
Productivity, Portability, Performance: Data-Centric Python Yiheng Wang Yao Zhang Yanzhang Wang Yan Wan Jiao Wang Zhongyuan Wu Yuhao Yang Bowen She 398 111 0 01 Jul 2021
What do End-to-End Speech Models Learn about Speaker, Language and Channel Information? A Layer-wise and Neuron-level Analysis Shammur A. Chowdhury Nadir Durrani Ahmed M. Ali 332 20 0 01 Jul 2021
Realtime Robust Malicious Traffic Detection via Frequency Domain AnalysisConference on Computer and Communications Security (CCS), 2021 Chuanpu Fu Qi Li Meng Shen Ke Xu AAML 148 198 0 28 Jun 2021
Cross-Modal Knowledge Distillation Method for Automatic Cued Speech RecognitionInterspeech (Interspeech), 2021 Jianrong Wang Zi-yue Tang Xuewei Li Mei Yu Qiang Fang Li Liu BDL 115 17 0 25 Jun 2021
Where are we in semantic concept extraction for Spoken Language Understanding? Sahar Ghannay Antoine Caubrière Salima Mdhaffar G. Laperriere Bassam Jabaian Yannick Esteve 175 18 0 24 Jun 2021
Structured in Space, Randomized in Time: Leveraging Dropout in RNNs for Efficient TrainingNeural Information Processing Systems (NeurIPS), 2021 Anup Sarma Sonali Singh Huaipan Jiang Rui Zhang M. Kandemir Chita R. Das 69 1 0 22 Jun 2021
Silent Speech and Emotion Recognition from Vocal Tract Shape Dynamics in Real-Time MRI Laxmi Pandey A. Arif 101 8 0 16 Jun 2021
Exploiting Large-scale Teacher-Student Training for On-device Acoustic ModelsWorkshop on Time-Delay Systems (TS), 2021 Jing Liu Rupak Vignesh Swaminathan S. Parthasarathi Chunchuan Lyu Athanasios Mouchtaris Siegfried Kunzmann 119 9 0 11 Jun 2021
U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition Di Wu Binbin Zhang Chao Yang Zhendong Peng Wenjing Xia Xiaoyu Chen X. Lei 178 55 0 10 Jun 2021
Unsupervised Automatic Speech Recognition: A ReviewSpeech Communication (Speech Commun.), 2021 Hanan Aldarmaki Asad Ullah Nazar Zaki VLM SSL 131 65 0 09 Jun 2021
Handcrafted Backdoors in Deep Neural NetworksNeural Information Processing Systems (NeurIPS), 2021 Sanghyun Hong Nicholas Carlini Alexey Kurakin 216 87 0 08 Jun 2021
Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech GenerationInternational Conference on Machine Learning (ICML), 2021 Dong Min Dong Bok Lee Eunho Yang Sung Ju Hwang 302 206 0 06 Jun 2021
Escaping Saddle Points Faster with Stochastic MomentumInternational Conference on Learning Representations (ICLR), 2020 Jun-Kun Wang Chi-Heng Lin Jacob D. Abernethy ODL 171 24 0 05 Jun 2021
Bottom-up and top-down approaches for the design of neuromorphic processing systems: Tradeoffs and synergies between natural and artificial intelligenceProceedings of the IEEE (Proc. IEEE), 2021 Charlotte Frenkel D. Bol Giacomo Indiveri 233 56 0 02 Jun 2021
A Generalizable Approach to Learning Optimizers Diogo Almeida Clemens Winter Jie Tang Wojciech Zaremba AI4CE 250 33 0 02 Jun 2021
A Sum-of-Ratios Multi-Dimensional-Knapsack Decomposition for DNN Resource SchedulingIEEE Conference on Computer Communications (INFOCOM), 2021 Menglu Yu Chuan Wu Bo Ji Jia Liu 113 10 0 28 May 2021
End-to-End Deep Fault Tolerant ControlIEEE/ASME transactions on mechatronics (IEEE/ASME Trans. Mechatronics), 2021 Daulet Baimukashev Bexultan Rakhim Matteo Rubagotti H. A. Varol 106 13 0 28 May 2021