What does a network layer hear? Analyzing hidden representations of end-to-end ASR through speech synthesis

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

4 November 2019

Papers citing "What does a network layer hear? Analyzing hidden representations of end-to-end ASR through speech synthesis"

16 / 16 papers shown

Tailored Design of Audio-Visual Speech Recognition Models using Branchformers

David Gimeno-Gómez

Carlos David Martínez Hinarejos

421

09 Jul 2024

Exploration of Adapter for Noise Robust Automatic Speech Recognition

Hao Shi

Tatsuya Kawahara

276

28 Feb 2024

MixRep: Hidden Representation Mixup for Low-Resource Speech RecognitionInterspeech (Interspeech), 2023

Jiamin Xie

John H. L. Hansen

156

27 Oct 2023

Bring the Noise: Introducing Noise Robustness to Pretrained Automatic Speech RecognitionInternational Conference on Artificial Neural Networks (ICANN), 2023

Patrick Eickhoff

M. Möller

Theresa Pekarek-Rosin

Johannes Twiefel

Stefan Wermter

151

05 Sep 2023

Reducing the gap between streaming and non-streaming Transducer-based ASR by adaptive two-stage knowledge distillationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

...

Jia Pan

163

27 Jun 2023

Hardware Acceleration of Explainable Artificial Intelligence

Zhixin Pan

Prabhat Mishra

290

04 May 2023

A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker OneIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Lingwei Meng

196

20 Feb 2023

Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher GuidanceIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Yuxuan Wang

222

27 Oct 2022

Privacy-Preserving Speech Representation Learning using Vector Quantization

15 Mar 2022

Towards Relatable Explainable AI with the Perceptual ProcessInternational Conference on Human Factors in Computing Systems (CHI), 2021

Wencan Zhang

Brian Y. Lim

AAML XAI

268

28 Dec 2021

Retrieving Speaker Information from Personalized Acoustic Models for Speech Recognition

170

07 Nov 2021

Focus on the present: a regularization method for the ASR source-target attention layerIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Nanxin Chen

Piotr Żelasko

Jesús Villalba

Najim Dehak

200

02 Nov 2020

Probing Acoustic Representations for Phonetic PropertiesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Danni Ma

Neville Ryant

M. Liberman

345

25 Oct 2020

TERA: Self-Supervised Learning of Transformer Encoder Representation for SpeechIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020

537

394

12 Jul 2020

Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio Representation

324

159

18 May 2020

Utterance-level Aggregation For Speaker Recognition In The WildIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

Weidi Xie

Arsha Nagrani

Joon Son Chung

Andrew Zisserman

237

364

26 Feb 2019